Skip to main content

Discovering Persistent Subgraph Patterns over Streaming Graphs

  • Conference paper
  • First Online:
Database Systems for Advanced Applications (DASFAA 2023)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13945))

Included in the following conference series:

  • 1474 Accesses

Abstract

Streaming graph analysis is gaining importance in various fields due to the natural dynamicity in many real graph applications. Prior subgraph discovery problem over streaming graphs mostly focuses on characteristics like frequency and burstiness. Persistence, as a new characteristic, is getting increasing attention. Persistent subgraph discovery highlights behaviors where a subgraph appears recurrently in many time windows, which is vital for many real-world applications (e.g., anomaly detection). While persistent subgraph discovery enjoys many interesting real-life applications, there is no off-the-shelf solution to compute the persistent pattern efficiently. In this work, we are the first to study the persistent subgraph pattern discovering problem over the streaming graph. We devise an auxiliary data structure called \(\textsf {TFD} \) to detect the persistent subgraph patterns in real-time with limited memory usage. \(\textsf {TFD} \) maps each subgraph into the corresponding bucket based on hash functions to compute the persistence of each pattern. Then we introduce optimizations to separate persistent and non-persistent patterns, further improving the effectiveness and throughput in space-scarce scenarios. Extensive experiments confirm the superiority of our proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.verizon.com/business/resources/reports/dbir/.

  2. 2.

    http://konect.uni-koblenz.de/networks/.

  3. 3.

    https://offshoreleaks.icij.org/pages/database.

  4. 4.

    http://socialnetworks.mpi-sws.org.

References

  1. Abdelhamid, E., Canim, M., Sadoghi, M., Bhattacharjee, B., Chang, Y., Kalnis, P.: Incremental frequent subgraph mining on large evolving graphs. In: ICDE 2018, pp. 1767–1768 (2018)

    Google Scholar 

  2. Aslay, Ç., Nasir, M.A.U., Morales, G.D.F., Gionis, A.: Mining frequent patterns in evolving graphs. In: CIKM 2018. pp. 923–932 (2018)

    Google Scholar 

  3. Belth, C., Zheng, X., Koutra, D.: Mining persistent activity in continually evolving networks. In: KDD 2020, pp. 934–944 (2020)

    Google Scholar 

  4. Chen, Z., Wang, X., Wang, C., Li, J.: Explainable link prediction in knowledge hypergraphs. In: CIKM 2022, pp. 262–271 (2022)

    Google Scholar 

  5. Choudhury, S., Holder, L.B., Jr., G.C., Agarwal, K., Feo, J.: A selectivity based approach to continuous pattern detection in streaming graphs. In: EDBT 2015, pp. 157–168 (2015)

    Google Scholar 

  6. Dai, H., Shahzad, M., Liu, A.X., Zhong, Y.: Finding persistent items in data streams. Proc. VLDB Endow. 10(4), 289–300 (2016)

    Article  Google Scholar 

  7. Hellmann, S., Stadler, C., Lehmann, J., Auer, S.: DBpedia live extraction. In: Meersman, R., Dillon, T., Herrero, P. (eds.) OTM 2009. LNCS, vol. 5871, pp. 1209–1223. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-05151-7_33

    Chapter  Google Scholar 

  8. Li, R., Su, J., Qin, L., Yu, J.X., Dai, Q.: Persistent community search in temporal networks. In: ICDE 2018, pp. 797–808 (2018)

    Google Scholar 

  9. Li, Y., Zou, L., Özsu, M.T., Zhao, D.: Time constrained continuous subgraph search over streaming graphs. In: ICDE 2019, pp. 1082–1093 (2019)

    Google Scholar 

  10. Li, Z., Liu, X., Wang, X., Liu, P., Shen, Y.: Transo: a knowledge-driven representation learning method with ontology information constraints. World Wide Web 26, 297–319 (2023)

    Article  Google Scholar 

  11. Ma, Z., Yang, J., Li, K., Liu, Y., Zhou, X., Hu, Y.: A parameter-free approach for lossless streaming graph summarization. In: Jensen, C.S., et al. (eds.) DASFAA 2021. LNCS, vol. 12681, pp. 385–393. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-73194-6_26

    Chapter  Google Scholar 

  12. Min, S., Park, S.G., Park, K., Giammarresi, D., Italiano, G.F., Han, W.: Symmetric continuous subgraph matching with bidirectional dynamic programming. Proc. VLDB Endow. 14(8), 1298–1310 (2021)

    Article  Google Scholar 

  13. Nasir, M.A.U., Aslay, Ç., Morales, G.D.F., Riondato, M.: Tiptap: approximate mining of frequent k-subgraph patterns in evolving graphs. ACM Trans. Knowl. Discov. Data 15(3), 48:1-48:35 (2021)

    Article  Google Scholar 

  14. Pacaci, A., Bonifati, A., Özsu, M.T.: Regular path query evaluation on streaming graphs. In: SIGMOD Conference 2020, pp. 1415–1430 (2020)

    Google Scholar 

  15. Ray, A., Holder, L., Choudhury, S.: Frequent subgraph discovery in large attributed streaming graphs. In: Proceedings of the 3rd International Workshop on Big Data, vol. 36, pp. 166–181 (2014)

    Google Scholar 

  16. Yang, X., Song, C., Yu, M., Gu, J., Liu, M.: Distributed triangle approximately counting algorithms in simple graph stream. ACM Trans. Knowl. Discov. Data 16(4), 79:1–79:43 (2022)

    Google Scholar 

  17. Zhang, L., Jiang, H., Wang, F., Feng, D., Xie, Y.: T-sample: a dual reservoir-based sampling method for characterizing large graph streams. In: ICDE 2019, pp. 1674–1677 (2019)

    Google Scholar 

  18. Zhang, Q., Guo, D., Zhao, X.: Discovering bursting patterns over streaming graphs. In: DASFAA 2022, pp. 441–458 (2022)

    Google Scholar 

  19. Zhang, Y., et al.: On-off sketch: a fast and accurate sketch on persistence. Proc. VLDB Endow. 14(2), 128–140 (2020)

    Article  Google Scholar 

Download references

Acknowledgements

This work is partially supported by National Natural Science Foundation of China under Grant No. U19B2024, National Natural Science Foundation of China under Grant No.6227246 and Postgraduate Scientific Research Innovation Project of Hunan Province under Grant No. CX20210038.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Qianzhen Zhang or Deke Guo .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Huang, C., Zhang, Q., Guo, D., Zhao, X. (2023). Discovering Persistent Subgraph Patterns over Streaming Graphs. In: Wang, X., et al. Database Systems for Advanced Applications. DASFAA 2023. Lecture Notes in Computer Science, vol 13945. Springer, Cham. https://doi.org/10.1007/978-3-031-30675-4_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-30675-4_11

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-30674-7

  • Online ISBN: 978-3-031-30675-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics