Abstract
With the rapid development of e-services on the Web, increasing number of e-catalogs are becoming accessible to users. A large number of e-catalogs provide information about similar type of products/services. To simplify users information searching effort, data integration systems have being developed to integrate e-catalogs providing similar type of information such that users can query those e-catalogs with a mediator through an uniform query interface. The conventional approach to answer a query received by a mediator is to select e-catalogs purely based on their query capabilities, i.e., query interface specifications. However, an e-catalog having the capability to answer a query does not mean it has relevant answers to the query. To remedy the wasted resources of querying catalogs that do not generate an answer, in this paper, we propose to use catalog content summary as a filter and select the relevant e-catalogs to answer a given query based not only on their query capabilities but also on their content relevance to the query. A multi-attribute content (MAC) summary is proposed to describe an e-catalog with respect to its content. With MAC summary, an e-catalog is selected to answer a query only if the e-catalog is likely having answers to the query. MAC summary can be constructed and updated using answers returned from e-catalogs and therefore the e-catalogs need not be cooperative. We evaluated MAC summary on 50 e-catalogs, and the experimental results were promising.
An erratum to this chapter can be found at http://dx.doi.org/10.1007/11914853_71.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval, 1st edn. Addison-Wesley, Reading (1999)
Baina, K., Benatallah, B., Paik, H.-Y., Toumani, F., Rey, C., Rutkowska, A., Susanto, H.: WS-CatalogNet: An infrastructure for creating, peering, and querying e-catalog communities. In: Proc. of VLDB 2004, Toronto, Canada, August 2004, pp. 1325–1328 (2004)
Benatallah, B., Hacid, M.-S., Paik, H.-Y., Rey, C., Toumani, F.: Towards semantic-driven, flexible and scalable framework for peering and quering e-catalog communities. Information Systems 31(4), 266–294 (2006)
Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. Computer Networks 30(1-7), 107–117 (1998)
Caverlee, J., Liu, L., Rocco, D.: Discovering and ranking web services with basil: a personalized approach with biased focus. In: Proc. of ICSOC 2004, pp. 153–162. ACM Press, New York (2004)
Chakrabarti, K., Chaudhuri, S., won Hwang, S.: Automatic categorization of query results. In: Proc. of ACM SIGMOD 2004, Paris, France, pp. 755–766. ACM Press, New York (2004)
Cheng, X., Dong, G., Lau, T., Su, J.: Data integration by describing sources with constraint databases. In: Proc. of ICDE 1999, Sydney, Australia. IEEE Computer Society Press, Los Alamitos (1999)
Conrad, J.G., Claussen, J.R.S.: Early user—system interaction for database selection in massive domain-specific online environments. ACM Trans. Inf. Syst. 21(1), 94–131 (2003)
Fan, J., Kambhampati, S.: A snapshot of public web services. SIGMOD Record 34(1), 24–32 (2005)
Gelle, E., Faltings, B.: Solving mixed and conditional constraint satisfaction problems. Constraints 8(2), 107–141 (2003)
Halevy, A.Y.: Answering queries using views: A survey. VLDB Journal 10(4), 270–294 (2001)
Ibarra, O.H., Su, J.: On the containment and equivalence of database queries with linear constraints (extended abstract). In: Proc. of PODS 1997, Tucson, Arizona, pp. 32–43. ACM Press, New York (1997)
Lee, D.H., Kim, M.H.: Database summarization using fuzzy isa hierarchies. IEEE Transactions on Systems, Man, and Cybernetics, Part B 27(1), 68–78 (1997)
Levy, A.Y., Rajaraman, A., Ordille, J.J.: Querying heterogeneous information sources using source descriptions. In: Proc. of VLDB 1996, Bombay, India, pp. 251–262. Morgan Kaufmann, San Francisco (1996)
Liu, L.: Query routing in large-scale digital library systems. In: Proc. of ICDE 1999, Washington DC, pp. 154–163. IEEE Computer Society Press, Los Alamitos (1999)
McCann, R., AlShebli, B.K., Le, Q., Nguyen, H., Vu, L., Doan, A.: Mapping maintenance for data integration systems. In: Proc. of VLDB 2005, Trondheim, Norway (2005)
Millstein, T., Levy, A., Friedman, M.: Query containment for data integration systems. In: Proc. of PODS 2000, Dallas, Texas, pp. 67–75. ACM Press, New York (2000)
Nie, Z., Kambhampati, S., Nambiar, U.: Effectively mining and using coverage and overlap statistics for data integration. IEEE Trans. on Knowledge and Data Eng. 17(5), 638–651 (2005)
OCEAN. On-board Communication, Entertainment, And iNformation, Available at: http://www.ocean.cse.unsw.edu.au .
Powell, A.L., French, J.C.: Comparing the performance of collection selection algorithms. ACM Trans. Inf. Syst. 21(4), 412–456 (2003)
Saint-Paul, R., Raschia, G., Mouaddib, N.: General purpose dataset summarization. In: Proc. of VLDB 2005, Trondheim, Norway (2005)
Ullman, J.D.: Information integration using logical views. In: Afrati, F.N., Kolaitis, P.G. (eds.) ICDT 1997. LNCS, vol. 1186, pp. 19–40. Springer, Heidelberg (1996)
Yu, C.T., Philip, G., Meng, W.: Distributed top-n query processing with possibly uncooperative local systems. In: Proc. of VLDB 2003, Berlin, Germany, September 2003, pp. 117–128. Morgan Kaufmann, San Francisco (2003)
Zhang, C., Naughton, J., DeWitt, D., Luo, Q., Lohman, G.: On supporting containment queries in relational database management systems. SIGMOD Rec. 30(2), 425–436 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sun, A., Benatallah, B., Hacid, MS., Hassan, M. (2006). Querying E-Catalogs Using Content Summaries. In: Meersman, R., Tari, Z. (eds) On the Move to Meaningful Internet Systems 2006: CoopIS, DOA, GADA, and ODBASE. OTM 2006. Lecture Notes in Computer Science, vol 4275. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11914853_8
Download citation
DOI: https://doi.org/10.1007/11914853_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-48287-1
Online ISBN: 978-3-540-48289-5
eBook Packages: Computer ScienceComputer Science (R0)