Abstract
One of the first application domains for computer science was Optical Character Recognition. At that time, it was expected that a machine would quickly be able to read any document. History has proven that the task was more difficult than that. This chapter explores the history of the document analysis and recognition domain, from OCR to page analysis and on to the open problems which are still to be completely dealt with.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
Many thanks to the authors of several chapters of this handbook for their contribution to the list of stubborn obstacles.
References
Schantz HF (1982) History of OCR, optical character recognition. Recognition Technologies Users Association, Manchester Center, Vt., USA
Rabinow J (1969) Whither OCR? And whence? Datamation 15(7):38–42
Mori S, Suen CY, Yamamoto K (1992) Historical review of OCR research and development. Proc IEEE 80(7):1029–1058
Nagy G (1992) At the frontiers of OCR. Proc IEEE 80(7):1093–1100
Mori S, Nishida H, Yamada H (1999) Optical character recognition. Wiley, New York
Sampson G (1985) Writing systems. Stanford University Press, Stanford
Ritchie G, Russell G, Black A, Pulman S (1992) Computational morphology. MIT, Cambridge
Wong KY, Casey RG, Wahl FM (1982) Document analysis system. IBM J Res Dev 26(6): 647–656
Wang D, Srihari SN (1989) Classification of newspaper image blocks using texture analysis. Comput Vis Graph Image Process 47:327–352
Nagy G, Seth S, Viswanathan M (1992) A prototype document image analysis system for technical journals. IEEE Comput Mag 25(7):10–22
Schürmann J (1978) A multifont word recognition system for postal address reading. IEEE Trans Comput 27(8):721–732
Kasturi R, Alemany J (1988) Information extraction from images of paper-based maps. IEEE Trans Softw Eng 14(5):671–675
Shimotsuji S, Hori O, Asano M, Suzuki K, Hoshino F, Ishii T (1992) A robust recognition system for a drawing superimposed on a map. IEEE Comput Mag 25(7):56–59
Groen F, Sanderson A, Schlag J (1985) Symbol recognition in electrical diagrams using probabilistic graph matching. Pattern Recognit Lett 3:343–350
Vaxivière P, Tombre K (1992) Celesstin: CAD conversion of mechanical drawings. IEEE Comput Mag 25(7):46–54
Rice S, Nagy G, Nartker T (1999) OCR: an illustrated guide to the frontier. Kluwer, Boston
Baird H (2007) The state of the art of document image degradation modeling. In: Chaudhuri B (ed) Digital document processing. Springer, London
Sellen A, Harper R (2003) The myth of the paperless office. MIT, Cambridge
Lesk M (1997) Practical digital libraries: books, bytes, & bucks. Morgan Kaufmann, San Francisco
Nunberg G (1996) The future of the book. University of California Press, Berkeley
Baird HS, Bunke H, Yamamoto K (eds) (1992) Structured document image analysis. Springer, Berlin/New York
Nagy G (2000) Twenty years of document image analysis in PAMI. IEEE Trans PAMI 22(1):38–62
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag London
About this entry
Cite this entry
Baird, H.S., Tombre, K. (2014). The Evolution of Document Image Analysis. In: Doermann, D., Tombre, K. (eds) Handbook of Document Image Processing and Recognition. Springer, London. https://doi.org/10.1007/978-0-85729-859-1_43
Download citation
DOI: https://doi.org/10.1007/978-0-85729-859-1_43
Published:
Publisher Name: Springer, London
Print ISBN: 978-0-85729-858-4
Online ISBN: 978-0-85729-859-1
eBook Packages: Computer ScienceReference Module Computer Science and Engineering