Abstract
In the real world, many complex systems consist of a large number of interacting groups of entities. A hypergraph consists of vertices and hyperedges that can connect multiple vertices. Since hypergraphs can effectively simulate complex intergroup relationships among entities, they have a wide range of applications such as computer vision and bioinformatics. In this paper, we study the subhypergraph containment query problem which is one of the most basic problems in the processing of hypergraphs. Existing methods on the subgraph query are designed for ordinary graphs and do not consider hypergraph features. If they are directly applied to subhypergraph containment query, they will suffer from hyperedge semantic incompleteness and label diversity sensitivity issues, resulting in inefficient algorithm performance. This motivates us to improve the performance by exploiting hyperedge features. In our work, we propose a novel framework for subhypergraph containment query called hyperedge filtering vertex testing. Based on the features of hypergraph, we propose an efficient filtering algorithm that can reduce the cost of the traditional filtering stage. In addition, we further propose efficient isomorphism testing techniques based on hyperedge vertex candidates to improve the performance. Extensive experiments on real datasets validate the superiority of our algorithm compared to existing methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Berge, C.: Hypergraphs: Combinatorics of Finite Sets, vol. 45. Elsevier, Amsterdam (1984)
Berman, H.M., et al.: The protein data bank. Nucl. Acids Res. 28(1), 235–242 (2000)
Bi, F., Chang, L., Lin, X., Qin, L., Zhang, W.: Efficient subgraph matching by postponing cartesian products. In: Proceedings of the 2016 ACM SIGMOD International Conference on Management of Data, pp. 1199–1214 (2016)
Bonnici, V., Ferro, A., Giugno, R., Pulvirenti, A., Shasha, D.: Enhancing graph database indexing by suffix tree structure. In: Dijkstra, T.M.H., Tsivtsivadze, E., Marchiori, E., Heskes, T. (eds.) PRIB 2010. LNCS, vol. 6282, pp. 195–203. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-16001-1_17
Bretto, A., Cherifi, H., Aboutajdine, D.: Hypergraph imaging: an overview. Pattern Recogn. 35(3), 651–658 (2002)
Bunke, H., Dickinson, P., Kraetzl, M., Neuhaus, M., Stettler, M.: Matching of hypergraphs: algorithms, applications, and experiments. In: Bunke, H., Kandel, A., Last, M. (eds) Applied Pattern Recognition, vol. 91, pp. 131–154. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-76831-9_6
Cheng, J., Ke, Y., Ng, W., Lu, A.: FG-Index: towards verification-free query processing on graph databases. In: Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data, pp. 857–872 (2007)
Giugno, R., Bonnici, V., Bombieri, N., Pulvirenti, A., Ferro, A., Shasha, D.: Grapes: a software for parallel searching on biological graphs targeting multi-core architectures. PLoS ONE 8(10), e76911 (2013)
Ha, T.W., Seo, J.H., Kim, M.H.: Efficient searching of subhypergraph isomorphism in hypergraph databases. In: IEEE International Conference on Big Data and Smart Computing (2018)
He, H., Singh, A.K.: Graphs-at-a-time: query language and access methods for graph databases. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, pp. 405–418 (2008)
Hwang, T.H., Tian, Z., Kuang, R., Kocher, J.P.: Learning on weighted hypergraphs to integrate protein interactions and gene expressions for cancer outcome prediction. In: Eighth IEEE International Conference on Data Mining (2008)
Katsarou, F., Ntarmos, N., Triantafillou, P.: Hybrid algorithms for subgraph pattern queries in graph databases. In: 2017 IEEE International Conference on Big Data (Big Data), pp. 656–665. IEEE (2017)
Klein, K., Kriege, N., Mutzel, P.: CT-Index: fingerprint-based graph indexing combining cycles and trees. In: 2011 IEEE 27th International Conference on Data Engineering, pp. 1115–1126. IEEE (2011)
Knoke, D., Yang, S.: Social Network Analysis. Sage Publications, Thousand Oaks (2019)
Ramadan, E., Tarafdar, A., Pothen, A.: A hypergraph model for the yeast protein complex network. In: 18th International Parallel and Distributed Processing Symposium. Proceedings, p. 189. IEEE (2004)
Shang, H., Zhang, Y., Lin, X., Yu, J.X.: Taming verification hardness: an efficient algorithm for testing subgraph isomorphism. Proc. VLDB Endow. 1(1), 364–375 (2008)
Su, Y., Gu, Y., Wang, Z., Zhang, Y., Qin, J., Yu, G.: Efficient subhypergraph matching based on hyperedge features. IEEE Transactions on Knowledge and Data Engineering (2022)
Sun, S., Luo, Q.: Scaling up subgraph query processing with efficient subgraph matching. In: 2019 IEEE 35th International Conference on Data Engineering (ICDE), pp. 220–231. IEEE (2019)
Wong, A.K.C., Lu, S.W.: Recognition and shape synthesis of 3-d objects based on attributed hypergraphs. IEEE Trans. Pattern Anal. Mach. Intell. 11(3), 279–290 (1989)
Yuan, D., Mitra, P.: Lindex: a lattice-based index for graph databases. VLDB J. 22(2), 229–252 (2013)
Zhang, H., Xie, X., Wen, Y., Zhang, Y.: A twig-based algorithm for top-k subgraph matching in large-scale graph data. In: Wang, G., Lin, X., Hendler, J., Song, W., Xu, Z., Liu, G. (eds.) WISA 2020. LNCS, vol. 12432, pp. 475–487. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-60029-7_43
Acknowledgements
This work is supported by the National Nature Science Foundation of China (62072083) and the Fundamental Research Funds of the Central Universities (N2216017).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Su, Y., Song, Y., Li, X., Li, F., Gu, Y. (2022). Efficient Subhypergraph Containment Queries on Hypergraph Databases. In: Zhao, X., Yang, S., Wang, X., Li, J. (eds) Web Information Systems and Applications. WISA 2022. Lecture Notes in Computer Science, vol 13579. Springer, Cham. https://doi.org/10.1007/978-3-031-20309-1_44
Download citation
DOI: https://doi.org/10.1007/978-3-031-20309-1_44
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-20308-4
Online ISBN: 978-3-031-20309-1
eBook Packages: Computer ScienceComputer Science (R0)