ABSTRACT
Existing XML query algebras are not fully appropriate to retrieve RSS news items mainly due to three reasons: 1) RSS is text rich and its content is dependent on the wording and verbification of the author, thus semantic aware operators are needed; 2) news items are dynamic and consequently time oriented retrieval is needed; 3) a news item may evolve through time, or overlap with other news items and hence identifying relationships between items is also needed. In this paper, we aim to solve these issues by providing a dedicated RSS algebra based on semantic-aware operators that consider RSS characteristics. The provided operators are application domain specific and can be tuned according to the user preferences. We also provide a set of query rewriting and equivalence rules that would be used during query simplification and optimization. In addition and in order to validate our proposal, we present here our prototype that allows a user to formulate RSS query using our operators.
- Babcock, B., Datar, M., and Motwani, R. Sampling from a moving window over streaming data. In 13th annual ACM-SIAM SODA (2002), 633--634. Google ScholarDigital Library
- Catania, B., Ferrari, E., Levy, A. Y., and Mendelzon, A. O. XML and Object Technology. In 1st ECOOP Workshops (2000), 191--202. Google ScholarDigital Library
- Chen, Z., Jagadish, H. V., Lakshmanan, L. V. S., and Paparizos, S. From Tree Patterns to Generalized Tree Patterns: On Efficient Evaluation of XQuery. In 29th Inter. Conf. on VLDB (2003), 237--248. Google ScholarDigital Library
- Clark, J. and DeRose, S. XML Path Language (XPath) Version 1.0. W3C Recommendation. W3C. 1999.Google Scholar
- Cohen, S., Mamou, J., Kanza, Y., and Sagiv, Y. XSEarch: A Semantic Search Engine for XML. In 29th Inter. Conf. on VLDB (2003), 45--56. Google ScholarDigital Library
- Di Lorenzo, G., Hacid, H., Paik, H., and Benatallah, B. Data integration in mashups. In SIGMOD Rec. (2009), 59--66. Google ScholarDigital Library
- Fisher, D., Lam, F., and Wong, R. K. Algebraic Transformation and Optimization for XQuery. In 6th Asia-Pacific Web Conf. (2004), 201--210.Google Scholar
- Frasincar, F., Houben, G., and Pau, C. XAL: an algebra for XML query optimization. In 13th Australasian Database Conf. (2002), 49--56. Google ScholarDigital Library
- Getahun, F., Tekli, J., Atnafu, S., and Chbeir, R. Towards Efficient Horizontal Multimedia Database Fragmentation using Semantic-based Predicates Implication. (2007), SBBD.Google Scholar
- Getahun, F., Tekli, J., Chbeir, R., Viviani, M., and Yetongnon, K. Semantic-based Merging of RSS Items. World Wide Web, 13, 1--2 (2009), 169--207. Google ScholarDigital Library
- Guo, L., Shao, F., Botev, C., and Shanmugasundaram, J. XRANK: Ranked Keyword Search over XML Documents. In SIGMOD Inter. Conf. on Management of Data (2003), 16--27. Google ScholarDigital Library
- Hammersley, B. Content Syndication with RSS. O'Reilly & Associates Publishers, San Francisco, 2003. Google ScholarDigital Library
- Hung, E., Deng, Y., and Subrahmanian, V. S. TOSS: an extension of TAX with ontologies and similarity queries. In SIGMOD '04 (Paris, France 2004), 719--730. Google ScholarDigital Library
- Jagadish, H. V., Lakshmanan, L. V., Srivastava, D., Thompson, K., and Srivastava., D. TAX: A Tree Algebra for XML. In 8th Inter. Workshop on DB Programming Lang. (2001), 149--164. Google ScholarDigital Library
- Kanne, C. and Moerkotte, G. Efficient storage of XML data. In 16th Inter. Conf. on Data Engineering (2000), 198. Google ScholarDigital Library
- Liu, H., Ramasubramanian, V., and Sirer, E. G. Client Behavior and Feed Characteristics of RSS, a Publish-Subscribe System for Web Micronews. In the 5th ACM SIGCOMM Conf. on Internet Measurement (2005), 3--3. Google ScholarDigital Library
- McGill, M. J. Introduction to Modern Information Retrieval. McGraw-Hil, New York, 1983. Google ScholarDigital Library
- Naughton, J. F., DeWitt, D. J, Maier, D., Aboulnaga, A., Chen, J., and et, al. The Niagara Internet query system. In IEEE Data Eng. Bulletin (2001), 27--33.Google Scholar
- Paparizos, S., Wu, Y., Lakshmanan, L. V., and Jagadish, H. V. Tree Logical Classes for Efficient Evaluation of XQuery. Conference. In Inter. Conf. on Management of Data (2004), ACM SIGMOD, 71--82. Google ScholarDigital Library
- Richardson, R and Smeaton., A. F. Using wordnet in a knowledge-based approach to information retrieval. Dublin, Ireland, 1995.Google Scholar
- Robie, J., Chamberlin, D., Dyck, M., and Snelson, J. XQuery 1.1: An XML Query Language W3C Working Draft. 2009.Google Scholar
- Rundensteiner, X. and Zhang, E. XAT: XML Algebra for the Rainbow System. WPI, 2002.Google Scholar
- Sartiani, C. and Albano, A. Yet Another Query Algebra For XML Data. In IDEAS (2002), IEEE Comp Society, 106--115. Google ScholarDigital Library
- Shanmugasundaram, J., Tufte, K., Zhang, C., He, G., DeWitt, D. J., and Naughton, J. F. Relational Databases for Querying XML Documents: Limitations and Opportunities. In 25th inter. Conf. on VLDB (1999), 302--314. Google ScholarDigital Library
- Shimura, T., Yoshikawa, M., and Uemura, S. Storage and Retrieval of XML Documents using Object-Relational Databases. In 10th inter. Conf. on DEXA (1999), 206--217. Google ScholarDigital Library
- Theobald, M., Schenkel, R., and Weikum, G. An Efficient and Versatile Query Engine for TopX Search. In 31st Inter. Conf. on VLDB (2005), 625--636. Google ScholarDigital Library
- WordNet 2.1. A Lexical Database of the English Lang. 2005.Google Scholar
Index Terms
- Semantic aware RSS query algebra
Recommendations
RSS query algebra: Towards a better news management
Existing XML query algebras are not fully appropriate to retrieve RSS news items mainly due to three reasons: (1) RSS document is text rich and its content is dependent on the wording and verification of the author, thus semantic-aware operators are ...
A semantic query approach to personalized e-catalogs service system
With the emergence of the e-Catalog, there has been an increasingly wide application of commodities query in distributed environment in the field of e-commerce. But e-Catalog is often autonomous and heterogeneous, effectively integrating and querying ...
An Ontology-Based System for Semantic Query over Heterogeneous Databases
WCSE '09: Proceedings of the 2009 WRI World Congress on Software Engineering - Volume 02With an exponential growth in the amount of information available in diverse domains, the traditional Information retrieval (IR) approaches which based on keywords can not meet the semantic needs of users. Semantic query, as an application of Semantic ...
Comments