Abstract
This paper describes document models and relations for the retrieval of document images. The underlying methodology was developed for the Intelligent Document Image Retrieval System (IDIR), which aims to extend document image database query capabilities. Traditional component type and keyword features are insufficient in describing logical and structural aspects of documents nature.
We have developed the necessary object-oriented document models to carry out complex multi domain retrieval scenarios. In this paper we focus on retrieval capabilities and underlying methodology that supports different schemes. For these models and query schemes (QS), new graphical techniques are introduced. The IDIR allows complex combinations of different QS's, using the extended concept of `frame logic', developed in our earlier work for the attribute management. Furthermore, a concept of document similarity is introduced with relations to QS's, document models, structure and role of use. Examples are shown for the retrieval of document images from University of Washington Database.
Preview
Unable to display preview. Download preview PDF.
References
Doermann D., Sauvola J., Kauniskangas H., Shin C., Pietikäinen M. and Rosenfeld A. (1997) The development of a general framework for intelligent document image retrieval. A book chapter in Document Analysis Systems II, Series in Machine Perception and Artificial Intelligence, 28 pages.
Information Retrieval: Data Structures and Algorithms, William B. Frakes, and Ricardo Baeza-Yates (Eds.), Prentice Hall, Englewood Cliffs, NJ, 1992.
Rao B.R. (1994) Object-oriented databases: technology, applications, and products. Database Experts' Series, McGraw-Hill, 253 pages.
E.G.M. Petrakis and C. Faloutsos. Similarity searching in large image databases. Technical Report CS-TR-3388, University of Maryland Institute for Advanced Computer Studies and Dept. of Computer Science, Univ. of Maryland, December 1994.
Gerald Salton, and Michael J. McGill, Introduction to Modern Information Retrieval, McGraw-Hill, New York, 1983.
Christian Shin, David Doermann, and Azriel Rosenfeld, Querying Document Image Databases using Structural Similarity, Technical Report, Center for Automation Research, University of Maryland at College Park (in preparation).
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sauvola, J., Doermann, D., Kauniskangas, H., Shin, C., Koivusaari, M., Pietikäinen, M. (1997). Graphical tools and techniques for querying document image databases. In: Murshed, N.A., Bortolozzi, F. (eds) Advances in Document Image Analysis. BSDIA 1997. Lecture Notes in Computer Science, vol 1339. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-63791-5_16
Download citation
DOI: https://doi.org/10.1007/3-540-63791-5_16
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63791-2
Online ISBN: 978-3-540-69646-9
eBook Packages: Springer Book Archive