Skip to main content
Log in

Fast graph similarity search via hashing and its application on image retrieval

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Similarity search in graph databases has been widely investigated. It is worthwhile to develop a fast algorithm to support similarity search in large-scale graph databases. In this paper, we investigate a k-NN (k-Nearest Neighbor) similarity search problem by locality sensitive hashing (LSH). We propose an innovative fast graph search algorithm named LSH-GSS, which first transforms complex graphs into vectorial representations based on prototypes in the database and later accelerates a query in Euclidean space by employing LSH. Because images can be represented as attributed graphs, we propose an approach to transform attributed graphs into n-dimensional vectors and apply LSH-GSS to execute further image retrieval. Experiments on three real graph datasets and two image datasets show that our methods are highly accurate and efficient.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15

Similar content being viewed by others

References

  1. Baeza-Yates R, Valiente G (2000) An image similarity measure based on graph matching. In: Proceedings of the seventh international symposium on string processing and information retrieval, 2000. SPIRE 2000. IEEE, pp 28–38

  2. Chatzichristofis SA, Boutalis YS (2008) Cedd: color and edge directivity descriptor: a compact descriptor for image indexing and retrieval. Comput Vision Syst 5008:312–322

    Article  Google Scholar 

  3. Cheng J, Ke Y, Ng W, Lu A (2007) Fg-index: towards verification-free query processing on graph databases. In: Proceedings of the 2007 ACM SIGMOD international conference on management of data. ACM, pp 857–872

  4. Comaniciu D, Meer P (2002) Mean shift: a robust approach toward feature space analysis. IEEE Trans Pattern Anal 24(5):603–619

    Article  Google Scholar 

  5. Dasgupta A, Kumar R, Sarlos T (2011) Fast locality-sensitive hashing. In: ACM SIGKDD International conference on knowledge discovery and data mining. San Diego, CA, USA, August, pp 1073–1081

  6. Datar M, Immorlica N, Indyk P, Mirrokni VS (2004) Locality-sensitive hashing scheme based on p-stable distributions. In: Proceedings of the twentieth annual symposium on computational geometry. ACM, pp 253–262

  7. Deng C, Liu X, Mu Y, Li J (2015) Large-scale multi-task image labeling with adaptive relevance discovery and feature hashing. Signal Process 112:137–145

    Article  Google Scholar 

  8. Doggaz N, Ferjani I (2011) Image segmentation using normalized cuts and efficient graph-based segmentation. Springer, Berlin

    Book  Google Scholar 

  9. Fan J, Yau DY, Elmagarmid AK, Aref WG (2001) Automatic image segmentation by integrating color-edge extraction and seeded region growing. IEEE Trans Image Process 10(10):1454–1466

    Article  MATH  Google Scholar 

  10. Felzenszwalb PF, Huttenlocher DP (2004) Efficient graph-based image segmentation. Int J Comput Vis 59(2):167–181

    Article  Google Scholar 

  11. Fischer B, Thies CJ, Guld MO, Lehmann TM (2004) Content-based image retrieval by matching hierarchical attributed region adjacency graphs. In: Medical imaging 2004, international society for optics and photonics, pp 598–606

  12. Gao X, Xiao B, Tao D, Li X (2010) A survey of graph edit distance. Pattern Anal Applic 13(1):113–129

    Article  MathSciNet  Google Scholar 

  13. Ge T, He K, Ke Q, Sun J (2013) Optimized product quantization. IEEE Trans Pattern Anal 36(4):744–755

    Article  Google Scholar 

  14. He H, Singh AK (2006) Closure-tree: an index structure for graph queries. In: 22nd international conference on data engineering (ICDE’06). IEEE, pp 38–38

  15. Ji T, Liu X, Deng C, Huang L, Lang B (2014) Query-adaptive hash code ranking for fast nearest neighbor search. ACM International Conference on Multimedia, pp 1005–1008

  16. Jiang H, Wang H, Philip S Y, Zhou S (2007) Gstring: a novel approach for efficient search in graph databases. In: 2007 IEEE 23rd international conference on data engineering. IEEE, pp 566–575

  17. Jouili S, Tabbone S (2009) Graph matching based on node signatures. In: International workshop on graph-based representations in pattern recognition. Springer, pp 154–163

  18. Jouili S, Tabbone S (2012) Hypergraph-based image retrieval for graph-based representation. Pattern Recogn 45(11):4054–4068

    Article  Google Scholar 

  19. Kailing K, Kriegel HP, Schönauer S (2004) Content-based image retrieval using multiple representations. In: International conference on knowledge-based and intelligent information and engineering systems. Springer, pp 982–988

  20. Koch I (2001) Enumerating all connected maximal common subgraphs in two graphs. Theor Comput Sci 250(1):1–30

    Article  MathSciNet  MATH  Google Scholar 

  21. Li CY, Hsu CT (2005) Soft region correspondence estimation for graph-theoretic image retrieval using quadratic programming approach. In: 2005 IEEE international conference on multimedia and expo. IEEE, pp 1416–1419

  22. Li CY, Hsu CT (2008) Image retrieval with relevance feedback based on graph-theoretic region correspondence estimation. IEEE Trans Multimedia 10(3):447–456

    Article  Google Scholar 

  23. Li Y, Xu Y, Wang J, Miao Z, Zhang Y (2017) Ms-rmac: Multiscale regional maximum activation of convolutions for image retrieval. IEEE Signal Process Lett 24(5):609–613

    Article  Google Scholar 

  24. Liu X, He J, Lang B (2013) Reciprocal hash tables for nearest neighbor search. In: AAAI

  25. Liu X, He J, Deng C, Lang B (2014) Collaborative hashing. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2139–2146

  26. Liu X, Huang L, Deng C, Lu J, Lang B (2015) Multi-view complementary hash tables for nearest neighbor search. IEEE International Conference on Computer Vision, pp 1107–1115

  27. Lv Q, Josephson W, Wang Z, Charikar M, Li K (2007) Multi-probe lsh: efficient indexing for high-dimensional similarity search. In: Proceedings of the 33rd international conference on very large data bases. VLDB Endowment, pp 950–961

  28. Mongiovi M, Di Natale R, Giugno R, Pulvirenti A, Ferro A, Sharan R (2010) Sigma: a set-cover-based inexact graph matching algorithm. J Bioinforma Comput Biol 8(2):199–218

    Article  Google Scholar 

  29. Mu Y, Chen X, Liu X, Chua T, Yan S (2012) Multimedia semantics-aware query-adaptive hashing with bits reconfigurability. Intern J Multimedia Inform Retrieval 1(1):59–70

    Article  Google Scholar 

  30. Niepert M, Ahmed M, Kutzkov K (2016) Learning convolutional neural networks for graphs. International Conference on Machine Learning, pp 2014–2023

  31. Regneri M (2007) Finding all cliques of an undirected graph. In: Seminar current trends in IE WS jun

  32. Riesen K, Bunke H (2008) Iam graph database repository for graph based pattern recognition and machine learning. In: Joint IAPR international workshops on statistical techniques in pattern recognition (SPR) and structural and syntactic pattern recognition (SSPR). Springer, pp 287–297

  33. Riesen K, Bunke H (2009) Approximate graph edit distance computation by means of bipartite graph matching. Image Vis Comput 27(7):950–959

    Article  Google Scholar 

  34. Riesen K, Bunke H (2009) Graph classification based on vector space embedding. Int J Pattern Recognit Artif Intell 23(6):1053–1081

    Article  MATH  Google Scholar 

  35. Shervashidze N, Borgwardt K M (2009) Fast subtree kernels on graphs. In: Advances in neural information processing systems, pp 1660–1668

  36. Tabei Y, Tsuda K (2011) Kernel-based similarity search in massive graph databases with wavelet trees. In: SDM. SIAM, pp 154–163

  37. Wale N, Watson IA, Karypis G (2008) Comparison of descriptor spaces for chemical compound retrieval and classification. Knowl Inf Syst 14(3):347–375

    Article  Google Scholar 

  38. Wang G, Wang B, Yang X, Yu G (2012) Efficiently indexing large sparse graphs for similarity search. IEEE Trans Knowl Data Eng 24(3):440–451

    Article  Google Scholar 

  39. Wang X, Ding X, Tung AK, Ying S, Jin H (2012) An efficient graph indexing method. In: 2012 IEEE 28th international conference on data engineering. IEEE, pp 210–221

  40. Wang Y, Xiao J, Suzek TO, Zhang J, Wang J, Bryant SH (2009) Pubchem: a public information system for analyzing bioactivities of small molecules. Nucleic Acids Res 37(suppl 2):W623–W633

    Article  Google Scholar 

  41. Yan X, Yu PS, Han J (2004) Graph indexing: a frequent structure-based approach. In: Proceedings of the 2004 ACM SIGMOD international conference on management of data. ACM, pp 335–346

  42. Yan X, Yu PS, Han J (2005) Substructure similarity search in graph databases. In: Proceedings of the 2005 ACM SIGMOD international conference on management of data. ACM, pp 766–777

  43. Zhang B, Liu X, Lang B (2015) Fast graph similarity search via locality sensitive hashing. In: Pacific rim conference on multimedia. Springer, pp 623–633

  44. Zhao P, Yu JX, Yu P S (2007) Graph indexing: tree + delta <= graph. In: Proceedings of the 33rd international conference on very large data bases. VLDB Endowment, pp 938–949

Download references

Acknowledgements

This work is supported in part by the National Natural Science Foundation of China (61370125 and 61402026), SKLSDE-2017ZX-03.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bo Wu.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lang, B., Wu, B., Liu, Y. et al. Fast graph similarity search via hashing and its application on image retrieval. Multimed Tools Appl 77, 16177–16198 (2018). https://doi.org/10.1007/s11042-017-5194-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-017-5194-8

Keywords

Navigation