ABSTRACT
With the rise of Web search engines, processing keyword search over XML and XML streams has drawn much attention from many researchers. Compared to conventional query methods, keyword search has several benefits for its simplicity and its user-friendliness in querying XML databases. Therefore, a great deal of effort has been put on this search paradigm by trying to improve the quality of search result of pure keyword search, where only keywords are allowed as a query. However, due to the vagueness of keyword search, it is hard to accurately express real search intention with just keyword search. We observe that there are many cases where the combination of path-based query and keyword search is a better choice and can deal with such challenge. To address this problem, we propose a method to integrate XPath and keyword search so that users can accurately express their search demands. The experimental results show that the proposed scheme can process queries over XML streams practically.
- Apache xml project. xerces java parser 1.2.3 release. http://xml.apache.org/xerces-j/index.html, 1999.Google Scholar
- XPath full text 1.0. www.w3.org/TR/xpath-full-text-10/, 2011.Google Scholar
- Computer science bibliography. www.cs.washington.edu/research/xmldatasets/www/repository.html/dblp, 2013.Google Scholar
- Extensible markup language. www.w3.org/XML/, 2013.Google Scholar
- World geographic database. www.cs.washington.edu/research/xmldatasets/www/repository.html/mondial, 2013.Google Scholar
- XPath expression. www.w3schools.com/xpath/, 2014.Google Scholar
- R. Busse, M. Carey, D. Florescu, M. Kersten, I. Manolescu, A. Schmidt, and F. Wass. XMark-an XML benchmark project. 2013.Google Scholar
- S. Cohen, J. Mamou, Y. Kanza, and Y. Sagiv. XSEarch: a semantic search engine for XML. In Proc. Of VLDB, volume 29, pages 45--56, 2003. Google ScholarDigital Library
- Y. Diao and M. J. Franklin. High-performance XML filtering: An overview of YFilter. In IEEE, pages 41--48, 2003.Google Scholar
- M. Gawinecki, F. Mandreoli, and G. Cabri. Keyword search over XML streams: Addressing time-stamping and understanding results. pages 371--382, U. of Modena, 2008.Google Scholar
- V. Hristidis and N. Koudas. Keyword proximity search in XML trees. IEEE Trans., 18:525--539, 2006. Google ScholarDigital Library
- C. Hummel, S. da Silva, M. Moro, and Laender H.F. Multiple keyword-based queries over XML streams. In ACM, pages 1577--1582, ACM New York, NY, USA, 2011. Google ScholarDigital Library
- A. Gupta J. Green and M. Onozuka. Processing XML stream with deterministic automata and stream indexes. ACM, 29:752--788, 2004. Google ScholarDigital Library
- Jaehoon K., Youngsoo K., and Seog P. Posfilter: An efficient filtering technique of xml documents based on postfix sharing.Google Scholar
- Candan K.S., Hsiung W., Chen S., Tatemura J., and D. Agrawal. Afilter: Adaptable xml filtering with prefix-caching and suffix-clustering.Google Scholar
- F. Shao L. Guo, C. Botev, and J. Shanmugasundaram. XRANK: ranked keyword search over XML documents. In Proc. Of SIGMOD Conf., pages 16--27, ACM New York, NY, USA, 2003. Google ScholarDigital Library
- Y. Li, C. Yu, and H. V. Jagadish. Schema-free XQuery. In VLDB, volume 30, pages 72--83, 2004. Google ScholarDigital Library
- Z. Vagena,, and M. Moro. Semantic search over XML document streams. In DATAX, 2008.Google Scholar
- Z. Vagena, L. S. Colby, F. Ozcan, A. Balmin, and Q. Li. On the effectiveness of flexible querying heuristics for XML data. In Proc. of XSym, volume 4704, pages 77--91, 2007. Google ScholarDigital Library
- Y. Xu and Y. Papakonstantinou. Efficient keyword search for smallest LCAs in XML databases. In Proc. of SIGMOD Conf., pages 527--538, ACM New York, NY, USA, 2005. Google ScholarDigital Library
- C. Yu and L. Popa. Constraint-based XML query rewriting for data integration. In Proc. of SIGMOD, pages 371--382, ACM New York, NY, USA, 2004. Google ScholarDigital Library
Index Terms
- Filtering XML Streams by XPath and Keywords
Recommendations
Keyword Search with Path-Based Filtering over XML Streams
SRDS '14: Proceedings of the 2014 IEEE 33rd International Symposium on Reliable Distributed SystemsRecently, a great deal of attention has been focusing on processing keyword search over static and XML streams. Keyword search is becoming more popular for its simplicity and its user-friendliness in querying XML databases. However, it is hard to ...
Efficient Top-k Keyword Search on XML Streams
ICYCS '08: Proceedings of the 2008 The 9th International Conference for Young Computer ScientistsKeywords can be used to query XML data without schema information. In this paper, a novel kind of query is proposed, top-k keyword search over XML streams. According to the set of keywords and the number of results, such query can retrieve the top-k XML ...
Efficient Algorithms for Skyline Top-K Keyword Queries on XML Streams
DASFAA '09: Proceedings of the 14th International Conference on Database Systems for Advanced ApplicationsKeywords are suitable for query XML streams without schema information. In current forms of keywords search on XML streams, rank functions do not always represent users' intentions. This paper addresses this problem in another aspect. In this paper, the ...
Comments