Skip to main content

Query Preprocessing for Intergrated Search in Heterogeneous Data Sources

  • Conference paper
Datenbanksysteme in Büro, Technik und Wissenschaft

Part of the book series: Informatik aktuell ((INFORMAT))

  • 215 Accesses

Abstract

SINGAPORE (SINGle Access POint for heterogeneous data REpositories) is a system for querying heterogeneous data. One of its particular features is that new sources may be registered at runtime. For this reason it does not rely on a predefined global integrated schema, but users can integrate data from the underlying sources when querying. Since formulating such queries may be a demanding task, our system allows the formulation of fuzzy queries, which are easier to formulate, at the expense of possibly producing less exact results. As a consequence, input queries need special treatment, called query preprocessing, which generates complex target queries that effectively return the results for the initial user queries. In this paper we discuss the importance of query preprocessing in our system, present heuristics for implementing it and show how techniques from database management systems and information retrieval can be combined in the process of query transformation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Ruxandra Domenig and Klaus R. Dittrich. A query based approach for integrating heterogeneous data sources. Proc. 9th Int’l Conf. on Information and Knowledge Management, Washington, DC, November 2000.

    Google Scholar 

  2. Ruxandra Domenig and Klaus R. Dittrich. Singapore: A query based approach for integrating heterogeneous data sources. Technical Report of the Institute of Information Technology, University of Zürich, 2000.

    Google Scholar 

  3. H. Garcia-Molina, Y. Papakonstantinou, D. Quass, A. Rajaraman, Y. Sagiv, J. Ullman, V. Vassalos, and J. Widom. The TSIMMIS approach to mediation: Data models and languages. Journal of Intelligent Information Systems, 1997.

    Google Scholar 

  4. L. Liu, C. Pu, and Y. Lee. Adaptive query mediation across heterogeneous information sources. In Proceedings of the International Conference on Cooperative Information Systems (CoopIS), 1996.

    Google Scholar 

  5. M. F. Porter. An algorithm for suffix stripping. Program, 14(3), July 1980.

    Google Scholar 

  6. M. Roth, F. Ozean, and L. Haas. Cost models do matter: Providing cost information for diverse data sources in a federated system. Proceedings of the international Conference on VLDB, 1999.

    Google Scholar 

  7. E. Selberg and O. Etzioni. Multi-service search and comparison using the MetaCrawler. In Proceedings of the 1995 World Wide Web Conference, 1995.

    Google Scholar 

  8. A. Tomasic, L. Raschid, and P. Valduriez. Scaling access to heterogeneous data sources with Disco. In IEEE Transactions on Knowledge and Data Engineering. IEEE, September/October 1998.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Domenig, R., Dittrich, K.R. (2001). Query Preprocessing for Intergrated Search in Heterogeneous Data Sources. In: Heuer, A., Leymann, F., Priebe, D. (eds) Datenbanksysteme in Büro, Technik und Wissenschaft. Informatik aktuell. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-56687-5_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-56687-5_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-41707-1

  • Online ISBN: 978-3-642-56687-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics