Skip to main content
Log in

Queries with aggregate functions over fuzzy RDF data

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

Since the World Wide Web Consortium (W3C) recommended the Resource Description Framework (RDF) data model, it has been widely adopted and applied in the era of Big Data and knowledge engineering. Now, large volumes of RDF data are produced and made available regularly. In real-world applications, information is often vague or ambiguous. Therefore, data may contain imperfect information, particularly in the context of open information extraction from the Web. The fuzzy RDF model has been proposed to handle uncertain semantics in RDF data. Aggregate functions are a group of functions for statistics and group data calculation, which are essential for the analysis and processing of RDF data. In spite of the increased attention given to aggregate queries over RDF data, little has been done on aggregate queries over fuzzy RDF data. This paper proposes a more scalable and flexible aggregate query method for fuzzy RDF data. By introducing the concepts of upper and lower bounds and expectation in probability theory, we identify three variants of the aggregate functions in RDF query language over the fuzzy RDF model. The language has been implemented by transformation based on standard SPARQL language and evaluated with experiments. In addition, we examine how to handle multiple types of aggregate operations.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Data availability

Available on request.

References

  1. Klyne G, Carroll JJ Resource description framework (RDF): Concepts and Abstract Syntax. http://www.w3.org/TR/2004/REC-rdf-concepts-20040210/

  2. Bornea MA et al. (2013) Building an efficient RDF store over a relational database. In: Proceedings of the 2013 ACM International Conference on Management of Data, 121–132

  3. Ji S et al (2020) A survey on knowledge graphs: representation, acquisition and applications. IEEE Trans Neural Netw Learn Syst 33(2):494–514

    Article  MathSciNet  Google Scholar 

  4. Manolis N, Tzitzikas Y (2011) Interactive exploration of fuzzy RDF knowledge bases. In: Proceedings of the 8th Extended Semantic Web Conference. 1–16

  5. Raju KVSVN, Majumdar AK (1988) Fuzzy functional dependencies and lossless join decomposition of fuzzy relational database systems. ACM Trans Database Syst 13(2):129–166

    Article  Google Scholar 

  6. Majumdar AK, Bhattacharya I, Saha AK (2002) An object-oriented fuzzy data model for similarity detection in image databases. IEEE Trans Knowl Data Eng 14(5):1186–1189

    Article  Google Scholar 

  7. Panic G, Rackovic M, Skrbic S (2014) Fuzzy XML and prioritized fuzzy XQuery with implementation. J Intell Fuzzy Syst 26(1):303–316

    Article  Google Scholar 

  8. Lisi FA, Straccia U (2015) Learning in description logics with fuzzy concrete domains. Fund Inform 140(3–4):373–391

    MathSciNet  MATH  Google Scholar 

  9. Morente-Molinera JA, Pérez IJ, Ureña MR, Herrera-Viedma E (2015) Building and managing fuzzy ontologies with heterogeneous linguistic information. Knowl-Based Syst 88:154–164

    Article  Google Scholar 

  10. Ma ZM, Li Y (2008) A literature overview of fuzzy database models. J Inf Sci Eng 24(1):189–202

    Google Scholar 

  11. Ma ZM, Li Y (2010) A literature overview of fuzzy conceptual data modeling. J Inf Sci Eng 26(2):427–441

    Google Scholar 

  12. Ma Z, Yan Li (2016) Modeling fuzzy data with XML: a survey. Fuzzy Sets Syst 301:146–159

    Article  MathSciNet  Google Scholar 

  13. Mazzieri M, Dragoni AF (2008) A fuzzy semantics for the resource description framework. URSW 2005–2007:244–261

    Google Scholar 

  14. Lv YH, Ma ZM, Yan L (2008) Fuzzy RDF: a data model to represent fuzzy metadata. In: Proceedings of the 2008 IEEE International Conference on Fuzzy Systems. Hong Kong, China, 1439–1445

  15. Ma Z, Li G, Yan Li (2018) Fuzzy data modeling and algebraic operations in RDF. Fuzzy Sets Syst 351:41–63

    Article  MathSciNet  MATH  Google Scholar 

  16. Vaneková V, Bella J, Gurský P, Horváth T (2005) Fuzzy RDF in the semantic web: deduction and induction. In: Proceedings of the 6th Workshop on Data Analysis, Abaujszanto, 16–29

  17. Straccia U (2009) A minimal deductive system for general fuzzy RDF. In: Proceedings of the 3th International Conference on Web Reasoning and Rule Systems, Chantilly, USA, 166–181

  18. Liu C, Qi GL, Wang HF, Yu Y (2011) Fuzzy reasoning over RDF data using OWL vocabulary. In: Proceedings of the 2011 IEEE/WIC/ACM International Conference on Web Intelligences, 162–169

  19. Buche P, Dibie-Barthélemy J, Hignette G (2008) Flexible querying of fuzzy RDF annotations using fuzzy conceptual graphs. In: Proceedings of the 16th International Conference on Conceptual Structures, 133–146

  20. Bahri A, Bouaziz R, Gargouri F (2010) Querying fuzzy RDFS semantic annotations. In: Proceedings of the 2010 IEEE International Conference on Fuzzy Systems, 1–8

  21. Pivert O, Slama O, Thion V (2016) An extension of SPARQL with fuzzy navigational capabilities for querying fuzzy RDF data. In: Proceedings of the 2016 IEEE International Conference on Fuzzy Systems, 2409–2416

  22. Pivert O, Slama O, Thion V (2017) Fuzzy quantified queries to fuzzy RDF databases. In: Proceedings of the 2017 IEEE International Conference on Fuzzy Systems, 2017

  23. Slama O(2019) Personalized queries under a generalized user profile model based on fuzzy SPARQL preferences. In: 2019 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), 2019: 1–6

  24. Li G, Yan Li, Ma Z (2019) Pattern match query over fuzzy RDF graph. Knowl-Based Syst 165:460–473

    Article  Google Scholar 

  25. Li G, Yan Li, Ma Z (2019) A method for fuzzy quantified querying over fuzzy resource description framework graph. Int J Intell Syst 34(6):1086–1107

    Article  Google Scholar 

  26. Li G, Yan Li, Ma Z (2019) An approach for approximate subgraph matching in fuzzy RDF graph. Fuzzy Sets Syst 376:106–126

    Article  MathSciNet  MATH  Google Scholar 

  27. Li G, Ma Z (2018) An efficient matching algorithm for fuzzy RDF graph. J Inf Sci Eng 34(2):519–534

    MathSciNet  Google Scholar 

  28. Ma Z, Yan Li (2018) Modeling fuzzy data with RDF and fuzzy relational database models. Int J Intell Syst 33(7):1534–1554

    Article  Google Scholar 

  29. Fan T, Yan Li, Ma Z (2019) Mapping fuzzy RDF(S) into fuzzy object-oriented databases. Int J Intell Syst 34(10):2607–2632

    Article  Google Scholar 

  30. Fan T, Yan Li, Ma Z (2020) Storing and querying fuzzy RDF(S) in HBase databases. Int J Intell Syst 35(4):751–780

    Article  Google Scholar 

  31. Jugel U, Jerzak Z, Hackenbroich G, Markl V (2016) VDDA: automatic visualization-driven data aggregation in relational databases. VLDB J 25(1):53–77

    Article  Google Scholar 

  32. Chan CY et al. (2002) Tree pattern aggregation for scalable XML data dissemination. In: Proceedings of 28th International Conference on Very Large Data Bases, 826–837

  33. Wu H, Ling TW, Xu L, Bao Z (2009) Performing grouping and aggregate functions in XML queries. In: Proceedings of the 18th International Conference on World Wide Web, 1001–1010

  34. Kaminski M, Kostylev EV, Grau BC (2017) Query nesting, assignment, and aggregation in SPARQL 1.1. ACM Trans Database Syst 42(3):1–46

    Article  MathSciNet  Google Scholar 

  35. McClean SI, Scotney BW, Shapcott M (2001) Aggregation of imprecise and uncertain information in databases. IEEE Trans Knowl Data Eng 13(6):902–912

    Article  MATH  Google Scholar 

  36. Scotney BW, McClean SI (2003) Database aggregation of imprecise and uncertain evidence. Inf Sci 155(3–4):245–263

    Article  Google Scholar 

  37. Murthy R, Ikeda R, Widom J (2011) Making aggregation work in uncertain and probabilistic databases. IEEE Trans Knowl Data Eng 23(8):1261–1273

    Article  Google Scholar 

  38. Pivert O, Prade H (2014) Dealing with aggregate queries in an uncertain database model based on possibilistic certainty. IPMU 3(2014):150–159

    MathSciNet  Google Scholar 

  39. Nermin Abdel-Hakim Othman, Eldin AS, El Zanfaly DS (2015) Enhancing aggregation over uncertain databases. In: 2015 IEEE International Conference on Computer and Information Technology; Ubiquitous Computing and Communications; Dependable, Autonomic and Secure Computing; Pervasive Intelligence and Computing, 132–139

  40. Damiani E, Tanca L, Fontana FA (2000) Fuzzy XML queries via context-based choice of aggregations. Kybernetika 36(6):635–655

    MATH  Google Scholar 

  41. Abiteboul S et al. (2010) Aggregate queries for discrete and continuous probabilistic XML. In: Proceedings of the 13th International Conference on Database Theory, 50–61

  42. Abiteboul S et al (2011) Capturing continuous data and answering aggregate queries in probabilistic XML. ACM Trans Database Syst 36(4):1–45

    Article  Google Scholar 

  43. Almendros-Jiménez JM, Becerra-Terón A, Moreno G (2017) A fuzzy extension of SPARQL based on fuzzy sets and aggregators. In: Proceedings of the 2017 IEEE International Conference on Fuzzy Systems, pp. 1–6

  44. Beckett D, RDF/XML Syntax Specification, http://www.w3.org/TR/2004/REC-rdf-syntax-grammar-20040210/

  45. Zadeh LA (1965) Fuzzy sets. Inf Control 8(3):338–353

    Article  MATH  Google Scholar 

  46. Zadeh LA (1978) Fuzzy sets as a basis for a theory of possibility. Fuzzy Sets Syst 1(1):3–28

    Article  MathSciNet  MATH  Google Scholar 

  47. Siddique N, Adeli H (2013) Computational intelligence synergies of fuzzy logic neural networks and evolutionary computing. Wiley, West Sussex

    Book  Google Scholar 

  48. Sheldon R (2002) A first course in probability. Pearson Education India, New Delhi

    MATH  Google Scholar 

Download references

Acknowledgements

The work is supported in part by the National Science Foundation of China (62176121 and 61772269).

Funding

The work is supported in part by the National Science Foundation of China (62176121 and 61772269).

Author information

Authors and Affiliations

Authors

Contributions

ZM wrote the main manuscript text; XZ and YZ prepared experiments. All authors reviewed the manuscript.

Corresponding author

Correspondence to Zongmin Ma.

Ethics declarations

Conflict of interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Consent for publication

All authors reviewed the manuscript and consent for publication.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ma, Z., Zhang, X. & Zhao, Y. Queries with aggregate functions over fuzzy RDF data. J Supercomput 79, 14780–14807 (2023). https://doi.org/10.1007/s11227-023-05235-x

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-023-05235-x

Keywords

Navigation