skip to main content
10.1145/2684200.2684309acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiiwasConference Proceedingsconference-collections
research-article

Filtering XML Streams by XPath and Keywords

Authors Info & Claims
Published:04 December 2014Publication History

ABSTRACT

With the rise of Web search engines, processing keyword search over XML and XML streams has drawn much attention from many researchers. Compared to conventional query methods, keyword search has several benefits for its simplicity and its user-friendliness in querying XML databases. Therefore, a great deal of effort has been put on this search paradigm by trying to improve the quality of search result of pure keyword search, where only keywords are allowed as a query. However, due to the vagueness of keyword search, it is hard to accurately express real search intention with just keyword search. We observe that there are many cases where the combination of path-based query and keyword search is a better choice and can deal with such challenge. To address this problem, we propose a method to integrate XPath and keyword search so that users can accurately express their search demands. The experimental results show that the proposed scheme can process queries over XML streams practically.

References

  1. Apache xml project. xerces java parser 1.2.3 release. http://xml.apache.org/xerces-j/index.html, 1999.Google ScholarGoogle Scholar
  2. XPath full text 1.0. www.w3.org/TR/xpath-full-text-10/, 2011.Google ScholarGoogle Scholar
  3. Computer science bibliography. www.cs.washington.edu/research/xmldatasets/www/repository.html/dblp, 2013.Google ScholarGoogle Scholar
  4. Extensible markup language. www.w3.org/XML/, 2013.Google ScholarGoogle Scholar
  5. World geographic database. www.cs.washington.edu/research/xmldatasets/www/repository.html/mondial, 2013.Google ScholarGoogle Scholar
  6. XPath expression. www.w3schools.com/xpath/, 2014.Google ScholarGoogle Scholar
  7. R. Busse, M. Carey, D. Florescu, M. Kersten, I. Manolescu, A. Schmidt, and F. Wass. XMark-an XML benchmark project. 2013.Google ScholarGoogle Scholar
  8. S. Cohen, J. Mamou, Y. Kanza, and Y. Sagiv. XSEarch: a semantic search engine for XML. In Proc. Of VLDB, volume 29, pages 45--56, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Y. Diao and M. J. Franklin. High-performance XML filtering: An overview of YFilter. In IEEE, pages 41--48, 2003.Google ScholarGoogle Scholar
  10. M. Gawinecki, F. Mandreoli, and G. Cabri. Keyword search over XML streams: Addressing time-stamping and understanding results. pages 371--382, U. of Modena, 2008.Google ScholarGoogle Scholar
  11. V. Hristidis and N. Koudas. Keyword proximity search in XML trees. IEEE Trans., 18:525--539, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. C. Hummel, S. da Silva, M. Moro, and Laender H.F. Multiple keyword-based queries over XML streams. In ACM, pages 1577--1582, ACM New York, NY, USA, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. A. Gupta J. Green and M. Onozuka. Processing XML stream with deterministic automata and stream indexes. ACM, 29:752--788, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Jaehoon K., Youngsoo K., and Seog P. Posfilter: An efficient filtering technique of xml documents based on postfix sharing.Google ScholarGoogle Scholar
  15. Candan K.S., Hsiung W., Chen S., Tatemura J., and D. Agrawal. Afilter: Adaptable xml filtering with prefix-caching and suffix-clustering.Google ScholarGoogle Scholar
  16. F. Shao L. Guo, C. Botev, and J. Shanmugasundaram. XRANK: ranked keyword search over XML documents. In Proc. Of SIGMOD Conf., pages 16--27, ACM New York, NY, USA, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Y. Li, C. Yu, and H. V. Jagadish. Schema-free XQuery. In VLDB, volume 30, pages 72--83, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Z. Vagena,, and M. Moro. Semantic search over XML document streams. In DATAX, 2008.Google ScholarGoogle Scholar
  19. Z. Vagena, L. S. Colby, F. Ozcan, A. Balmin, and Q. Li. On the effectiveness of flexible querying heuristics for XML data. In Proc. of XSym, volume 4704, pages 77--91, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Y. Xu and Y. Papakonstantinou. Efficient keyword search for smallest LCAs in XML databases. In Proc. of SIGMOD Conf., pages 527--538, ACM New York, NY, USA, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. C. Yu and L. Popa. Constraint-based XML query rewriting for data integration. In Proc. of SIGMOD, pages 371--382, ACM New York, NY, USA, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Filtering XML Streams by XPath and Keywords

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      iiWAS '14: Proceedings of the 16th International Conference on Information Integration and Web-based Applications & Services
      December 2014
      587 pages

      Copyright © 2014 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 4 December 2014

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader