skip to main content
10.1145/1030397.1030431acmconferencesArticle/Chapter ViewAbstractPublication PagesdocengConference Proceedingsconference-collections
Article

Digital capture for automated scanner workflows

Published: 28 October 2004 Publication History

Abstract

The use of scanners and other capture devices to incorporate film- and paper-based materials into digital workflows is an important part of "digital convergence", or the bringing of paper-based and electronic documents together into the same electronic workflows. The diversity of captured information-from text and mixed-type documents to photos, negatives, slides and transparencies-requires a combination of document analysis techniques to perform, automatically, the segmentation, classification and workflow assignment of the scanned images. We herein present technologies that provide fast (< 1.0 sec) and reliable (> 95% job accuracy) capture solutions for all of these input content types. These solutions offer near real-time capture that provides automated workflow capabilities to a repertoire of scanning hardware: scanners, all-in-one devices, copiers and multifunctional printers. The techniques used to categorize the documents, perform zoning analysis on the documents, and then perform closed loop quality assurance on the documents are presented.

References

[1]
Santos, F.E. "A scanning system in which a portion of a preview scan image of a picture displaced on a screen is selected and a corresponding portion of the picture is scanned in a final scan," U.S. Patent 4,837,635, June 6, 1989.
[2]
Simske, S. "The Use of XML and XML-Data to Provide Document Understanding at the Physical, Logical and Presentational Levels," In Proc. of the ICDAR99 Workshop on Document Layout Interpretation and its Applications, Sept. 1999.
[3]
Lee, J.P., Lopez, P.D., and Simske, S.J. "Click and select user interface for document scanning," U.S. Patent no. 6,151,426, Nov. 21, 2000.
[4]
Simske, S.J., Carleton, J.M. and Lesser, R.R. "Digital imaging device with background training," U.S. Patent 6,683,984; Jan. 27, 2004.
[5]
Wahl, F.M., Wong, K.Y. and Casey, R.G. "Block segmentation and text extraction in mixed/image documents," Computer Vision Graphics and Image Processing, Vol. 2, pp.375--390, 1982.
[6]
Kittler, J. and Illingworth, J. "Minimum error thresholding," Pattern Recognition, Vol 19, no. 1, pp. 41--47, 1986.
[7]
Otsu, N. "A threshold selection method from gray level histograms," Pattern Recognition, Vol 9, no. 1, pp. 62--66, 1979.
[8]
Kurita, T., Otsu, N. and Abdelmalek, N. "Maximum likelihood thresholding based on population mixture models," Pattern Recognition, Vol 25, no. 10, pp. 1231--1240, 1992.
[9]
Kittler, J., Illingworth, J. and Föglein, J. "Threshold selection based on a simple image statistic," Comp. Vision Graph. Image Proc., Vol 30, pp. 125--147, 1985.
[10]
Wilkinson, M.H.F. "Optimizing edge detectors for robust automatic threshold selection: coping with edge curvature and noise," Graph. Models Image Proc., Vol 60, pp. 385--401, 1998.
[11]
Simske, S.J. and Lee, J.P. "System and method for manipulating regions in a scanned image," U.S. Patent 6,263,122, Jul. 17, 2001.
[12]
Simske, S.J. and Lesser, R.R. "User interface high-lighter function to provide directed input for image processing," U.S. Patent 6,385, 351, May 7, 2002.
[13]
Simske, S.J. and Russon, V.K. "Document analysis system and method," U.S. Patent 6,674,901; Jan. 6, 2004.
[14]
Revankar, S.V. and Fan, Z. "Image segmentation system", U.S. Patent 5,767,978, January 21, 1997.
[15]
Shi, J. and Malik, J. "Normalized cuts and image segmentation," IEEE Trans Pattern Analysis Machine Intelligence, Vol 22, no. 8, pp. 888-905, 2000.
[16]
Zramdini, A. and Ingold, R. "Optical font recognition from projection profiles," Electronic Publishing, Vol 6, no. 3, pp. 249--260, Sept. 1993.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
DocEng '04: Proceedings of the 2004 ACM symposium on Document engineering
October 2004
252 pages
ISBN:1581139381
DOI:10.1145/1030397
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 October 2004

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. classification
  2. negatives
  3. photos
  4. scanning
  5. segmentation
  6. slides
  7. user interface
  8. zoning

Qualifiers

  • Article

Conference

DocEng04
Sponsor:
DocEng04: ACM Symposium on Document Engineering
October 28 - 30, 2004
Wisconsin, Milwaukee, USA

Acceptance Rates

Overall Acceptance Rate 194 of 564 submissions, 34%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 19 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2018)Document Representations (Inclusive Native and Relational)Encyclopedia of Database Systems10.1007/978-1-4614-8265-9_138(1241-1247)Online publication date: 7-Dec-2018
  • (2017)Document Representations (Inclusive Native and Relational)Encyclopedia of Database Systems10.1007/978-1-4899-7993-3_138-2(1-6)Online publication date: 2-Aug-2017
  • (2016)Efficient and accurate document image classification algorithms for low-end copy pipelinesEURASIP Journal on Image and Video Processing10.1186/s13640-016-0135-42016:1Online publication date: 7-Oct-2016
  • (2013)Application of Parallelism by ComponentMeta‐Algorithmics10.1002/9781118626719.ch5(137-174)Online publication date: 27-May-2013
  • (2009)New Trends in Digital Scanning ProcessesProceedings of the 2009 10th International Conference on Document Analysis and Recognition10.1109/ICDAR.2009.76(1071-1075)Online publication date: 26-Jul-2009
  • (2008)Document page classification algorithms in low-end copy pipelineJournal of Electronic Imaging10.1117/1.301087917:4(043011)Online publication date: 1-Oct-2008
  • (2007)A Document Page Classification Algorithm in Copy Pipeline2007 IEEE International Conference on Image Processing10.1109/ICIP.2007.4379290(III - 237-III - 240)Online publication date: Sep-2007

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media