Abstract
Resource description framework (RDF) stream is useful to model spatio-temporal data. In this paper, we propose a framework for large-scale RDF stream processing, LRSP, to process general continuous queries over large-scale RDF streams. Firstly, we propose a formalization (named CT-SPARQL) to represent the general continuous queries in a unified, unambiguous way. Secondly, based on our formalization we propose LRSP to process continuous queries in a common white-box way by separating RDF stream processing, query parsing, and query execution. Finally, we implement and evaluate LRSP with those popular continuous query engines on some benchmark datasets and real-world datasets. Due to the architecture of LRSP, many efficient query engines (including centralized and distributed engines) for RDF can be directly employed to process continuous queries. The experimental results show that LRSP has a higher performance, specially, in processing large-scale real-world data.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Barbieri D F, Braga D, Ceri S, Valle E D, Grossniklaus M. Querying RDF streams with C-SPARQL. ACM SIGMOD Record, 2010, 39(1): 20-26.
Le-Phuoc D, Dao-Tran M, Parreira J X, Hauswirth M. A native and adaptive approach for unified processing of linked streams and linked data. In Proc. the 10th Int. Semantic Web Conference, October 2011, pp.370-388.
Anicic D, Fodor P, Rudolph S, Stojanovic N. EP-SPARQL: A unified language for event processing and stream reasoning. In Proc. the 20th Int. Conference on World Wide Web, March 2011, pp.635-644.
Zou L, Özsu M T, Chen L, Shen X, Huang R, Zhao D. gStore: A graph-based SPARQL query engine. The VLDB Journal, 2014, 23(4): 565-590.
Neumann T, Weikum G. The RDF-3X engine for scalable management of RDF data. The VLDB Journal, 2010, 19(1): 91-113.
Peng P, Zou L, Özsu M T, Chen L, Zhao D. Processing SPARQL queries over distributed RDF graphs. The VLDB Journal, 2016, 25(2): 243-268.
Gurajada S, Seufert S, Miliaraki I, Theobald M. TriAD: A distributed shared-nothing RDF engine based on asynchronous message passing. In Proc. the 2014 ACM SIGMOD Int. Conference on Management of Data, June 2014, pp.289-300.
Li Q, Zhang X, Feng Z. PRSP: A plugin-based framework for RDF stream processing. In Proc. the 26th Int. Conference on World Wide Web Companion, April 2017, pp.815-816.
Alessandro M, Gianpaolo C. Processing flows of information: From data stream to complex event processing. In Proc. the 5th ACM Int. Conference on Distributed Event-Based Systems, July 2011, pp.359-360.
Kolchin M, Wetz P, Kiesling E, Tjoa A M. YABench: A comprehensive framework for RDF stream processor correctness and performance assessment. In Proc. the 16th International Conference on Web Engineering, June 2016, pp.280-298.
Arasu A, Babu S, Widom J. The CQL continuous query language: Semantic foundations and query execution. The VLDB Journal, 2006, 15(2): 121-142.
Carroll J J, Dickinson I, Dollin C, Reynolds D, Seaborne A, Wilkinson K. Jena: Implementing the semantic web recommendations. In Proc. the 13th Int. Conference on World Wide Web-Alternate Track Papers & Posters, May 2004, pp.74-83.
Ren X, Curé O. Strider: A hybrid adaptive distributed RDF stream processing engine. In Proc. the 16th Int. Semantic Web Conference, October 2017, pp.559-576.
Dell’Aglio D, Valle E D, Calbimonte J P, Corcho Ó. RSPQL semantics: A unifying query model to explain heterogeneity of RDF stream processing systems. Int. Journal on Semantic Web and Information Systems, 2014, 10(4): 17-44.
Dell’Aglio D, Calbimonte J P, Valle E D, Corcho Ó. Towards a unified language for RDF stream query processing. In Proc. the 12th European Semantic Web Conference, May 2015, pp.353-363.
Brandt S, Kalayci E G, Ryzhikov V, Xiao G, Zakharyaschev M. Querying log data with temporal logic. Journal of Artificial Intelligence Research, 2018, 62: 829-877.
Li L, Kim J, Xu J, Zhou X. Time-dependent route scheduling on road networks. ACM SIGSPATIAL Special, 2018, 10(1): 10-14.
Qian Z, Xu J, Zheng K, Zhao P, Zhou X. Semantic-aware top-k spatial keyword queries. World Wide Web: Internet and Web Information Systems, 2018, 21(3): 573-594.
Lanti D, Xiao G, Calvanese D. VIG: Data scaling for OBDA benchmarks. Semantic Web, 2019, 10(2): 413-433.
Zhao B. Research on adaptive RDF stream processing architecture [Master Thesis]. College of Intelligence and Computing, Tianjin University, 2018. (in Chinese)
Li J, Liu C, Yu J X, Chen Y, Sellis T, Culpepper J S. Personalized influential topic search via social network summarization. IEEE Transactions on Konwledge and Data Engineering, 2016, 28(7): 1820-1834.
Li J, Sellis T, Culpepper J S, He Z, Liu C, Wang J. Geosocial influence spanning maximization. IEEE Transactions on Knowledge and Data Engineering, 2017, 29(8): 1653-1666.
Li J, Wang X, Deng K, Yang X, Sellis T, Yu J X. Most influential community search over large social networks. In Proc. the 33rd IEEE Int. Conference on Data Engineering, April 2017, pp.871-882.
Li J, Liu C, Islam M S. Keyword-based correlated network computation over large social media. In Proc. the 30th IEEE Int. Conference on Data Engineering, March 2014, pp.268-279.
Li J, Cai T, Mian A, Li R, Sellis T, Yu J X. Holistic influence maximization for targeted advertisements in spatial social networks. In Proc. the 34th IEEE Int. Conference on Data Engineering, April 2018, pp.1340-1343.
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
ESM 1
(PDF 553 kb)
Rights and permissions
About this article
Cite this article
Fang, H., Zhao, B., Zhang, XW. et al. A United Framework for Large-Scale Resource Description Framework Stream Processing. J. Comput. Sci. Technol. 34, 762–774 (2019). https://doi.org/10.1007/s11390-019-1941-9
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11390-019-1941-9