Abstract
Applications which require a combination of structured data with unstructured text fields are becoming of increasing practical interest. But whereas structured data are usually stored in a relational database, large text collections are maintained by proprietary text or information retrieval systems. The synthesis of both areas is still a topic of intensive research. We describe one such application, namely maintaining library catalogues, and study the efficiency of two implementation alternatives both based on RDBMS technology. In the first alternative word occurrence information is encoded using bitlists. The other chooses a direct implementation within the relational model. Performance tests are done which are based on real world data and real world user transactions. They demonstrate that the problem of the bitlist implementation is caused by conversions which are necessary to combine them with structured data. In contrast, our direct implementation benefits from today's sophisticated RDBMS technology and performs promisingly well.
Preview
Unable to display preview. Download preview PDF.
References
O. Balownew, T. Bode, A.B. Cremers, J. Kalinski, J.E. Wolff, and H. Rottmann. Maintaing Library Catalogues with an RDBMS — A Performance Study —. Technical Report IAI-TR-96-13, University of Bonn, November 1996.
S. DeFazio, A. Daoud, L.A. Smith, and J. Srinivasan. Integrating IR and RDBMS using cooperative indexing. In Proc. of the 18th Annual Int. SIGIR Conf. on Research and Development in Information Retrieval, pages 84–92, 1995.
Deutsche Forschungsgemeinschaft — Bibliotheksausschuß. Empfehlungen zur Migration der deutschen Bibliotheksverbiinde. ZfBB, 42(2):105–136, 1995.
W.B. Frakes and R. Baeza-Yates, editors. Information Retrieval — Data Structures and Algorithms. Prentice Hall, 1992.
Jürgen Freitag, Horst-Dieter Werner, and Wolfgang Wilkes. Strukturierte Attribute in Relationen zur Unterstützung von IR-Anwendungen. In GI 12. Jahrestagung, Informatik-Fachberichte 57, pages 623–647. Springer, 1982.
Graham Hoare. Oracle TextServer3 — Workbench C Guide. Oracle Corp., 500 Oracle Parkway, Redwood City, CA 94065, 1995. Version 1.0, Part No. A22190-1.
H. Kaufmann and H.-J. Schek. Text Search Using Database Systems Revisited — Some Experiments —. In C.A. Goble and J.A. Keane, editors, Proc. of the 13th British National Conference on Databases (BNCOD 13), pages 204–225. Springer, LNCS 940, 1995.
Ian A. Macleod. Text retrieval and the relational model. Journal of the American Society for Information Science, 42(3):155–165, 1991.
National Institute of Standards and Technology. Proceedings of the Fourth Text REtrieval Conference (TREC-4), Gaithersburg, Md., November 1995.
J. Newton and D.Y. Brenman. Oracle TextServer3 — Administrator's Guide. Oracle Corp., 500 Oracle Parkway, Redwood City, CA 94065, 1995. Version 3.0, Part No. A22191-1.
Hans-Jörg Schek. Methods for the administration of textual data in database systems. In R.N. Oddy, S.E. Robertson, C.J. van Rijsbergen, and P.W. Williams, editors, Information Retrieval Research, pages 218–235. Butterworths, 1981.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Balownew, O., Bode, T., Cremers, A.B., Kalinski, J., Wolff, J.E., Rottmann, H. (1997). A library application on top of an RDBMS: Performance aspects. In: Hameurlain, A., Tjoa, A.M. (eds) Database and Expert Systems Applications. DEXA 1997. Lecture Notes in Computer Science, vol 1308. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0022057
Download citation
DOI: https://doi.org/10.1007/BFb0022057
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63478-2
Online ISBN: 978-3-540-69580-6
eBook Packages: Springer Book Archive