Skip to main content
Log in

Answering ordered tree pattern queries over fuzzy XML data

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

The widespread employment of XML requires the development of efficient methods to support advanced tree pattern queries. In addition, in real-world applications that involve analysis and management of databases, uncertainty is often an inherent property of data. Although a large number of researches have been dedicated to modeling uncertain XML and answering queries over uncertain XML data, how to conduct order-based tree pattern queries on fuzzy XML data remains an open problem. In this work, we aim to fill this gap and tackle the problem of order-based queries on fuzzy XML data. Based on the node encoding, we propose efficient tree pattern matching algorithms serving for ordered queries over fuzzy XML data. In particular, our algorithms adopt a two-phase processing technique: the first phase generates the root-to-leaf path solutions, and the second phase merges these path solutions for producing the desired matches to the ordered tree pattern. Typically, our approach only needs to access data related to the query nodes in the tree pattern once to obtain all the matching answers. Finally, a comprehensive experimental evaluation is carried out to investigate the performance of the proposed approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Notes

  1. Since OTP could directly transform itself into previous holistic matching approaches (e.g., HolisticTwig [17], C-Twig [18], etc), we omit the comparison between our proposed algorithms and previous holistic matching approaches.

References

  1. Abiteboul S, Segoufin L, Vianu V (2001) Representing and querying XML with incomplete information. In: Proceedings of the ACM SIGMOD-SIGACT-SIGART symposium on principles of database systems, pp 150–161

  2. Abiteboul S, Senellart P (2006) Querying and updating probabilistic information in XML. In: Proceedings of the international conference on extending database technology, pp 1059–1068

  3. Al-Khalifa S, Jagadish H, Koudas N et al (2002) Structural joins: a primitive for efficient XML query pattern matching. In: Proceedings of the IEEE international conference on data engineering, pp 141–152

  4. Bruno N, Koudas N, Srivastava D (2002) Holistic twig joins: optimal XML pattern matching. In: Proceedings of the ACM SIGMOD international conference on management of data, pp 310–321

  5. Cohen S, Kimelfeld B, Sagiv Y (2008) Incorporating constraints in probabilistic XML. In: Proceedings of the ACM SIGMOD-SIGACT-SIGART symposium on principles of database systems, pp 109–118

  6. Damiani E, Tanca L, Arcelli F (2000) Fuzzy XML queries via context-based choice of aggregations. Kybernetika 36:635–655

    MATH  Google Scholar 

  7. El-Sayed M, Dimitrova K, Rundensteiner E (2005) Efficiently supporting order in XML query processing. Data Knowl Eng 54:355–390

    Article  Google Scholar 

  8. Gaurav A, Alhajj R (2006) Incorporating fuzziness in XML and mapping fuzzy relational data into fuzzy XML. In: Proceedings of the 2006 ACM symposium on applied, computing, pp 456–460

  9. Hung E, Getoor L, Subrahmanian VS (2003) PXML: a probabilistic semistructured data model and algebra. In: Proceedings of the IEEE international conference on data, engineering, pp 467–478

  10. Jampani R, Xu F, Wu M (2008) MCDB: a Monte Carlo approach to managing uncertain data. In: Proceedings of the ACM SIGMOD international conference on management of data, pp 687–700

  11. Jiang H, Wang W, Lu H et al (2003) Holistic twig joins on indexed XML socuments. In: Proceedings of the ACM international conference on very large data bases, pp 273–284

  12. Kimelfeld B, Kosharovsky Y, Sagiv Y (2008) Query efficiency in probabilistic XML models. In: Proceedings of the ACM SIGMOD international conference on management of data, pp 701–714

  13. Kimelfeld B, Sagiv Y (2007) Matching twigs in probabilistic XML. In: Proceedings of the ACM international conference on very large data, bases, pp 27–38

  14. Klir G, Folder T (1982) Fuzzy sets, uncertainty and information. Prentice-Hall, Englewood Cliffs

    Google Scholar 

  15. Koyuncu M, Yazici A (2003) IFOOD: an intelligent fuzzy object-oriented database architecture. IEEE Trans Knowl Eng 15(5):1137–1154

    Article  Google Scholar 

  16. Lee J, Fanjiang Y, Kuo J et al (2002) Modeling imprecise requirements with XML. In: Proceedings of IEEE international fuzzy systems conference proceedings, pp 861–866

  17. Li Y, Wang G, Xin J et al (2009) Holistically twig matching in probabilistic XML. In : Proceedings of the IEEE international conference on data engineering, pp 1649–1656

  18. Liu J, Ma ZM, Yan L (2009) Efficient processing of twig pattern matching in fuzzy XML. In: Proceedings of the ACM international conference on information and knowledge management, pp 117–126

  19. Liu J, Ma ZM, Yan L (2013) Querying and ranking incomplete twigs in probabilistic XML. World Wide Web 16(3):325–353

    Article  MathSciNet  Google Scholar 

  20. Lu J, Ling T W, Chan C et al (2005) From region encoding to extended dewey: on efficient processing of XML twig pattern matching. In: Proceedings of the ACM international conference on very large data bases, pp 193–204

  21. Lu J, Ling T W, Yu T et al (2005) Efficient processing of ordered XML twig pattern. In: Proceedings of the international conference on database and expert systems applications, pp 300–309

  22. Ma ZM, Liu J, Yan L (2011) Matching twigs in fuzzy XML. Inf Sci 181(1):184–200

    Article  MATH  MathSciNet  Google Scholar 

  23. Moro M, Vagena Z, Tsotras V (2005) Tree-pattern queries on a lightweight XML processor. In: Proceedings of the ACM international conference on very large sata, bases, pp 205–216

  24. Nierrman A, Jagadish HV (2002) ProTDB: probabilistic data in XML. In: Proceedings of the ACM international conference on very large data, bases, pp 646–657

  25. Pei J, Jiang B, Lin X et al (2007) Probabilistic skylines on uncertain data. In: Proceedings of the ACM international conference on very large data bases, pp 15–26

  26. Raju K, Majumdar A (1988) Fuzzy functional dependencies and lossless join decomposition of fuzzy relational database systems. ACM Trans Database Syst 13(2):129–166

    Article  Google Scholar 

  27. Senellart P, Abiteboul S (2007) On the complexity of managing probabilistic XML data. In: Proceedings of the ACM SIGMOD-SIGACT-SIGART symposium on principles of database systems, pp 283–292

  28. Tatarinov I, Viglas S, Beyer K et al (2002) Storing and querying ordered XML using a relational database system. In: Proceedings of the ACM SIGMOD international conference on management of data, pp 204–215

  29. Turowskia K, Weng U (2002) Representing and processing fuzzy information-an XML-based approach. Knowl Based Syst 15(1):67–75

    Article  Google Scholar 

  30. University of Washington XML Repository. http://www.cs.washington.edu/research/xmldatasets/

  31. XMARK the XML-benchmark project. http://monetdb.cwi.nl/xml/index.html

  32. Yager R (2000) Targeted E-commerce marketing using fuzzy intelligent agents. IEEE Intell Syst 15(6): 42–45

    Google Scholar 

  33. Zadeh LA (1965) Fuzzy sets. Inf Control 8(3):338–353

    Article  MATH  MathSciNet  Google Scholar 

  34. Zadeh LA (1978) Fuzzy sets as a basis for a theory of possibility. Fuzzy Sets Syst 1(1):3–28

    Article  MATH  MathSciNet  Google Scholar 

Download references

Acknowledgments

This work is supported by the National Natural Science Foundation of China (60873010) and the Fundamental Research Funds for the Central Universities (N090504005 and N100604017). The authors would also like to express their gratitude to the anonymous reviewers for providing very helpful suggestions. Many thanks are due to Mr. Qv for helpful comments and suggestions.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jian Liu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liu, J., Ma, Z.M. & Feng, X. Answering ordered tree pattern queries over fuzzy XML data. Knowl Inf Syst 43, 473–495 (2015). https://doi.org/10.1007/s10115-014-0731-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-014-0731-5

Keywords

Navigation