Skip to main content

Query Preprocessing for Intergrated Search in Heterogeneous Data Sources

  • Conference paper
Datenbanksysteme in Büro, Technik und Wissenschaft

Part of the book series: Informatik aktuell ((INFORMAT))

Abstract

SINGAPORE (SINGle Access POint for heterogeneous data REpositories) is a system for querying heterogeneous data. One of its particular features is that new sources may be registered at runtime. For this reason it does not rely on a predefined global integrated schema, but users can integrate data from the underlying sources when querying. Since formulating such queries may be a demanding task, our system allows the formulation of fuzzy queries, which are easier to formulate, at the expense of possibly producing less exact results. As a consequence, input queries need special treatment, called query preprocessing, which generates complex target queries that effectively return the results for the initial user queries. In this paper we discuss the importance of query preprocessing in our system, present heuristics for implementing it and show how techniques from database management systems and information retrieval can be combined in the process of query transformation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ruxandra Domenig and Klaus R. Dittrich. A query based approach for integrating heterogeneous data sources. Proc. 9th Int’l Conf. on Information and Knowledge Management, Washington, DC, November 2000.

    Google Scholar 

  2. Ruxandra Domenig and Klaus R. Dittrich. Singapore: A query based approach for integrating heterogeneous data sources. Technical Report of the Institute of Information Technology, University of Zürich, 2000.

    Google Scholar 

  3. H. Garcia-Molina, Y. Papakonstantinou, D. Quass, A. Rajaraman, Y. Sagiv, J. Ullman, V. Vassalos, and J. Widom. The TSIMMIS approach to mediation: Data models and languages. Journal of Intelligent Information Systems, 1997.

    Google Scholar 

  4. L. Liu, C. Pu, and Y. Lee. Adaptive query mediation across heterogeneous information sources. In Proceedings of the International Conference on Cooperative Information Systems (CoopIS), 1996.

    Google Scholar 

  5. M. F. Porter. An algorithm for suffix stripping. Program, 14(3), July 1980.

    Google Scholar 

  6. M. Roth, F. Ozean, and L. Haas. Cost models do matter: Providing cost information for diverse data sources in a federated system. Proceedings of the international Conference on VLDB, 1999.

    Google Scholar 

  7. E. Selberg and O. Etzioni. Multi-service search and comparison using the MetaCrawler. In Proceedings of the 1995 World Wide Web Conference, 1995.

    Google Scholar 

  8. A. Tomasic, L. Raschid, and P. Valduriez. Scaling access to heterogeneous data sources with Disco. In IEEE Transactions on Knowledge and Data Engineering. IEEE, September/October 1998.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Domenig, R., Dittrich, K.R. (2001). Query Preprocessing for Intergrated Search in Heterogeneous Data Sources. In: Heuer, A., Leymann, F., Priebe, D. (eds) Datenbanksysteme in Büro, Technik und Wissenschaft. Informatik aktuell. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-56687-5_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-56687-5_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-41707-1

  • Online ISBN: 978-3-642-56687-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics