Abstract
Many users and applications require the integration of semi-structured data from autonomous, heterogeneous Web sources. Over the last years mediator systems have emerged that use domain knowledge to overcome the problem of structural heterogeneity. However, many users of these systems do not have a thorough knowledge of the complex global schemas and of the comprehensive query languages. Consequently, easy-to-use query interfaces like keyword search and browsing have to be supported. The aim of the proposed PhD project is the index-based realization of keyword searches in concept-based mediator systems. In order to avoid unnecessary source queries an index structure is maintained on the global level and used during query planning and processing.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abiteboul, S.: Querying Semi-Structured Data. In: Afrati, F.N., Kolaitis, P.G. (eds.) ICDT 1997. LNCS, vol. 1186, pp. 1–18. Springer, Heidelberg (1996)
Agrawal, S., Chaudhuri, S., Das, G.: DBXplorer: A System for Keyword-Based Search over Relational Databases. In: ICDE 2002, pp. 5–16 (2002)
Amann, B., Beeri, C., Fundulaki, I., Scholl, M.: Ontology-Based Integration of XML Web Resources. In: Horrocks, I., Hendler, J. (eds.) ISWC 2002. LNCS, vol. 2342, pp. 117–131. Springer, Heidelberg (2002)
Arens, Y., Chee, C.Y., Hsu, C.-N., Knoblock, C.A.: Retrieving and Integrating Data from Multiple Information Sources. International Journal on Intelligent and Cooperative Information Systems 2(2), 127–158 (1993)
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Addison-Wesley, Reading (1999)
Baru, C.K., Gupta, A., Ludäscher, B., Marciano, R., Papakonstantinou, Y., Velikhov, P., Chu, V.: XML-Based Information Mediation with MIX. In: SIGMOD 1999, pp. 597–599 (1999)
Bergmann, M.K.: The deep web: Surfacing hidden value (2003), http://www.brightplanet.com/deepcontent/tutorials/DeepWeb/
Bhalotia, G., Hulgeri, A., Nakhe, C., Chakrabarti, S., Sudarshan, S.: Keyword Searching and Browsing in Databases using Banks. In: ICDE 2002, pp. 431–440 (2002)
Callan, J.P., Connell, M.E.: Query-based sampling of text databases. Information Systems 19(2), 97–130 (2001)
Chakrabarti, S., van den Berg, M., Dom, B.: Focused Crawling: a new approach to topic specfic Web resource discovery. WWW8 / Computer Networks 31(11–16), 1623–1640 (1999)
Florescu, D., Kossmann, D., Manolescu, I.: Integrating Keyword Search into XML Query Processing. WWW9 / Computer Networks 33(1–6), 119–135 (2000)
Friedmann, A., Levy, A., Millstein, T.: Navigational Plans for Data Integration. In: AAAI/IAAI 1999, pp. 67–73 (1999)
Garcia-Molina, H., Papakonstantinou, Y., Quass, D., Rajaraman, A., Sagiv, Y., Ullman, J.D., Vassalos, V., Widom, J.: The TSIMMIS Approach to Mediation: Data Models and Languages. Journal of Intelligent Information Systems 8(2), 117–132 (1997)
Goh, C.H., Bressan, S., Madnick, S.E., Siegel, M.D.: Context Interchange: New Features and Formalisms for the Intelligent Integration of Information. ACM Transactions on Information Systems 17(3), 270–293 (1999)
Green, N., Ipeirotis, P.G., Gravano, L.: SDLIP + STARTS = SDARTS a protocol and toolkit for metasearching. In: ACM/IEEE Joint Conference on Digital Libraries, pp. 207–214 (2001)
Hristidis, V., Papakonstantinou Y.: Discover: Keyword Search in Relational Databases. In: VLDB 2002, pp. 670–681 (2002)
Ipeirotis, P.G., Gravano, L.: Distributed Search over the Hidden Web: Hirarchical Database Sampling and Selection. In: VLDB 2002 (2002)
Karnstedt, M., Sattler, K.-U., Geist, I., Höpfner, H.: Semantic Caching in Ontology-based Mediator Systems. In: Berliner XML Tage 2003, 3rd Int. Workshop Web und Datenbanken, pp. 155–169 (October 2003)
Levy, A.Y., Rajaraman, A., Ordille, J.J.: Querying Heterogeneous Information Sources Using Source Descriptions. In: VLDB 1996, pp. 251–262 (1996)
Ludäscher, B., Gupta, A., Martone, M.E.: Model-based Mediation with Domain Maps. In: ICDE 2001, pp. 82–90 (2001)
Magkanaraki, A., Karvounarakis, G., Anh, T.T., Christophides, V., Plexousakis, D.: Ontology Storage and Querying. Technical Report 308, Foundation for Research and Technology Hellas, Institute of Computer Science (April 2002)
Masermann, U., Vossen, G.: Design and Implementation of a Novel Approach to Keyword Searching in Relational Databases. In: Masunaga, Y., Thalheim, B., Štuller, J., Pokorný, J. (eds.) ADBIS 2000 and DASFAA 2000. LNCS, vol. 1884, pp. 171–184. Springer, Heidelberg (2000)
Sattler, K.-U., Geist, I., Schallehn, E.: Concept-based Querying in Mediator Systems. The VLDB Journal (to appear, 2004)
Sizov, S., Theobald, M., Siersdorfer, S., Weikum, G., Graupmann, J., Biwer, M., Zimmer, P.: The BINGO! System for Information Portal Generation and Expert Web Search. In: CIDR 2003 (2003)
Theobald, A., Weikum, G.: The Index-Based XXL Search Engine for Querying XML Data with Relevance Ranking. In: Jensen, C.S., Jeffery, K., Pokorný, J., Šaltenis, S., Bertino, E., Böhm, K., Jarke, M. (eds.) EDBT 2002. LNCS, vol. 2287, pp. 477–495. Springer, Heidelberg (2002)
Wiederhold, G.: Mediators in the Architecture of Future Information Systems. IEEE Computer 25(3), 38–49 (1992)
Zipf, G.K.: Human Behavior and the Principle of Least Effort. Addison-Wesley, Reading (1949)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Geist, I. (2004). Index-Based Keyword Search in Mediator Systems. In: Lindner, W., Mesiti, M., Türker, C., Tzitzikas, Y., Vakali, A.I. (eds) Current Trends in Database Technology - EDBT 2004 Workshops. EDBT 2004. Lecture Notes in Computer Science, vol 3268. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30192-9_3
Download citation
DOI: https://doi.org/10.1007/978-3-540-30192-9_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23305-3
Online ISBN: 978-3-540-30192-9
eBook Packages: Computer ScienceComputer Science (R0)