Abstract
Our work concerns three major processes which help to build a DL database for IR. Several components of the system are shown to be useful for creating a DL database. The design of these processes stresses the importance of robustness and ease of adaptation to the processing of different documents. Output generated by these processes facilitates the IR mechanism to produce intelligent response to user queries. An adaptive approach to document understanding was presented in this chapter. Its robustness was shown to be crucial to the success in processing varied library documents. This chapter also presented an adaptation of the vector space model for information retrieval to improving the performance of a word recognition algorithm. The neighborhoods of visually similar words determined by word recognition are matched to a database of documents and a subset of documents with topics that are similar to those of the input image are determined.
Preview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1995 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Srihari, S.N., Lam, S.W., Hull, J.J. (1995). Document recognition for a Digital Library. In: Adam, N.R., Bhargava, B.K., Yesha, Y. (eds) Digital Libraries Current Issues. DL 1994. Lecture Notes in Computer Science, vol 916. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0026853
Download citation
DOI: https://doi.org/10.1007/BFb0026853
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-59282-2
Online ISBN: 978-3-540-49230-6
eBook Packages: Springer Book Archive