skip to main content
10.1145/1967486.1967533acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiiwasConference Proceedingsconference-collections
research-article

Semantic aware RSS query algebra

Published:08 November 2010Publication History

ABSTRACT

Existing XML query algebras are not fully appropriate to retrieve RSS news items mainly due to three reasons: 1) RSS is text rich and its content is dependent on the wording and verbification of the author, thus semantic aware operators are needed; 2) news items are dynamic and consequently time oriented retrieval is needed; 3) a news item may evolve through time, or overlap with other news items and hence identifying relationships between items is also needed. In this paper, we aim to solve these issues by providing a dedicated RSS algebra based on semantic-aware operators that consider RSS characteristics. The provided operators are application domain specific and can be tuned according to the user preferences. We also provide a set of query rewriting and equivalence rules that would be used during query simplification and optimization. In addition and in order to validate our proposal, we present here our prototype that allows a user to formulate RSS query using our operators.

References

  1. Babcock, B., Datar, M., and Motwani, R. Sampling from a moving window over streaming data. In 13th annual ACM-SIAM SODA (2002), 633--634. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Catania, B., Ferrari, E., Levy, A. Y., and Mendelzon, A. O. XML and Object Technology. In 1st ECOOP Workshops (2000), 191--202. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Chen, Z., Jagadish, H. V., Lakshmanan, L. V. S., and Paparizos, S. From Tree Patterns to Generalized Tree Patterns: On Efficient Evaluation of XQuery. In 29th Inter. Conf. on VLDB (2003), 237--248. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Clark, J. and DeRose, S. XML Path Language (XPath) Version 1.0. W3C Recommendation. W3C. 1999.Google ScholarGoogle Scholar
  5. Cohen, S., Mamou, J., Kanza, Y., and Sagiv, Y. XSEarch: A Semantic Search Engine for XML. In 29th Inter. Conf. on VLDB (2003), 45--56. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Di Lorenzo, G., Hacid, H., Paik, H., and Benatallah, B. Data integration in mashups. In SIGMOD Rec. (2009), 59--66. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Fisher, D., Lam, F., and Wong, R. K. Algebraic Transformation and Optimization for XQuery. In 6th Asia-Pacific Web Conf. (2004), 201--210.Google ScholarGoogle Scholar
  8. Frasincar, F., Houben, G., and Pau, C. XAL: an algebra for XML query optimization. In 13th Australasian Database Conf. (2002), 49--56. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Getahun, F., Tekli, J., Atnafu, S., and Chbeir, R. Towards Efficient Horizontal Multimedia Database Fragmentation using Semantic-based Predicates Implication. (2007), SBBD.Google ScholarGoogle Scholar
  10. Getahun, F., Tekli, J., Chbeir, R., Viviani, M., and Yetongnon, K. Semantic-based Merging of RSS Items. World Wide Web, 13, 1--2 (2009), 169--207. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Guo, L., Shao, F., Botev, C., and Shanmugasundaram, J. XRANK: Ranked Keyword Search over XML Documents. In SIGMOD Inter. Conf. on Management of Data (2003), 16--27. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Hammersley, B. Content Syndication with RSS. O'Reilly & Associates Publishers, San Francisco, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Hung, E., Deng, Y., and Subrahmanian, V. S. TOSS: an extension of TAX with ontologies and similarity queries. In SIGMOD '04 (Paris, France 2004), 719--730. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Jagadish, H. V., Lakshmanan, L. V., Srivastava, D., Thompson, K., and Srivastava., D. TAX: A Tree Algebra for XML. In 8th Inter. Workshop on DB Programming Lang. (2001), 149--164. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Kanne, C. and Moerkotte, G. Efficient storage of XML data. In 16th Inter. Conf. on Data Engineering (2000), 198. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Liu, H., Ramasubramanian, V., and Sirer, E. G. Client Behavior and Feed Characteristics of RSS, a Publish-Subscribe System for Web Micronews. In the 5th ACM SIGCOMM Conf. on Internet Measurement (2005), 3--3. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. McGill, M. J. Introduction to Modern Information Retrieval. McGraw-Hil, New York, 1983. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Naughton, J. F., DeWitt, D. J, Maier, D., Aboulnaga, A., Chen, J., and et, al. The Niagara Internet query system. In IEEE Data Eng. Bulletin (2001), 27--33.Google ScholarGoogle Scholar
  19. Paparizos, S., Wu, Y., Lakshmanan, L. V., and Jagadish, H. V. Tree Logical Classes for Efficient Evaluation of XQuery. Conference. In Inter. Conf. on Management of Data (2004), ACM SIGMOD, 71--82. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Richardson, R and Smeaton., A. F. Using wordnet in a knowledge-based approach to information retrieval. Dublin, Ireland, 1995.Google ScholarGoogle Scholar
  21. Robie, J., Chamberlin, D., Dyck, M., and Snelson, J. XQuery 1.1: An XML Query Language W3C Working Draft. 2009.Google ScholarGoogle Scholar
  22. Rundensteiner, X. and Zhang, E. XAT: XML Algebra for the Rainbow System. WPI, 2002.Google ScholarGoogle Scholar
  23. Sartiani, C. and Albano, A. Yet Another Query Algebra For XML Data. In IDEAS (2002), IEEE Comp Society, 106--115. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Shanmugasundaram, J., Tufte, K., Zhang, C., He, G., DeWitt, D. J., and Naughton, J. F. Relational Databases for Querying XML Documents: Limitations and Opportunities. In 25th inter. Conf. on VLDB (1999), 302--314. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Shimura, T., Yoshikawa, M., and Uemura, S. Storage and Retrieval of XML Documents using Object-Relational Databases. In 10th inter. Conf. on DEXA (1999), 206--217. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Theobald, M., Schenkel, R., and Weikum, G. An Efficient and Versatile Query Engine for TopX Search. In 31st Inter. Conf. on VLDB (2005), 625--636. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. WordNet 2.1. A Lexical Database of the English Lang. 2005.Google ScholarGoogle Scholar

Index Terms

  1. Semantic aware RSS query algebra
                  Index terms have been assigned to the content through auto-classification.

                  Recommendations

                  Comments

                  Login options

                  Check if you have access through your login credentials or your institution to get full access on this article.

                  Sign in
                  • Published in

                    cover image ACM Other conferences
                    iiWAS '10: Proceedings of the 12th International Conference on Information Integration and Web-based Applications & Services
                    November 2010
                    895 pages
                    ISBN:9781450304214
                    DOI:10.1145/1967486

                    Copyright © 2010 ACM

                    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                    Publisher

                    Association for Computing Machinery

                    New York, NY, United States

                    Publication History

                    • Published: 8 November 2010

                    Permissions

                    Request permissions about this article.

                    Request Permissions

                    Check for updates

                    Qualifiers

                    • research-article

                  PDF Format

                  View or Download as a PDF file.

                  PDF

                  eReader

                  View online with eReader.

                  eReader