Abstract
Subgraph matching is to enumerate all the subgraphs of a graph that is isomorphic to the query graph. It is a critical component of many applications such as clustering coefficient computation and trend evolution. As the real-world graph grows explosively, we have massive graphs that are much larger than the memory size of the modern machines. Therefore, in this paper, we study the subgraph matching problem where the graph is stored on disk. Different from the existing approaches, we design a block-based approach, \(\mathsf {GScan}\), which investigates the schedule of the blocks transferred between the memory and the disk. To achieve high I/O efficiency, \(\mathsf {GScan}\) only uses sequential I/O read operations. We conduct experimental studies to demonstrate the efficiency of our block-based approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kim, H., et al.: Dualsim: parallel subgraph enumeration in a massive graph on a single machine. In: Proceedings of SIGMOD (2016)
Wang, J., Cheng, J.: Truss decomposition in massive networks. Proc. VLDB Endow. 5(9), 812 (2012)
Watts, D.J., Strogatz, S.: Collective dynamics of ‘small-world’ networks. Nature 393, 440–442 (1998)
Acknowledgement
This work was supported by the grants from NSFC 61602395, RGC 12201518, RGC 12232716, RGC 12258116, RGC 12200817, and RGC 12201615.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Zhang, Z., Wei, H., Xu, J., Choi, B. (2019). GScan: Exploiting Sequential Scans for Subgraph Matching. In: Li, G., Yang, J., Gama, J., Natwichai, J., Tong, Y. (eds) Database Systems for Advanced Applications. DASFAA 2019. Lecture Notes in Computer Science(), vol 11448. Springer, Cham. https://doi.org/10.1007/978-3-030-18590-9_69
Download citation
DOI: https://doi.org/10.1007/978-3-030-18590-9_69
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-18589-3
Online ISBN: 978-3-030-18590-9
eBook Packages: Computer ScienceComputer Science (R0)