Abstract
Measuring and detecting graph similarities is an important topic with numerous applications. Early algorithms often incur quadratic time or higher, making them unsuitable for graphs of very large scales. Motivated by the cooling process of an object in a thermodynamic system, we devise a new method for measuring graph similarities that can be carried out in linear time. Our algorithm, called Random Walker Termination (RWT), employs a large number of random walkers to capture the structure of a given graph using termination rates in a time sequence. To verify the effectiveness of the RWT algorithm, we use three major graph models, namely, the Erdős-Rényi random graphs, the Watts-Strogatz small-world graphs, and the Barabási-Albert preferential-attachment graphs, to generate graphs of different sizes. We show that the RWT algorithm performs well for graphs generated by these models. Our experiment results agree with the actual similarities of generated graphs. Using self-similarity tests, we show that RWT is sufficiently stable to generate consistent results. We use the graph edge rerouting test and the cross model test to demonstrate that RWT can effectively identify structural similarities between graphs.





Similar content being viewed by others
References
Albert R, Barabási AL (2002) Statistical mechanics of complex networks. Rev Mod Phys 74(1):47–97
Cha SH (2007) Comprehensive survey on distance/similarity measures between probability density functions. Int J Math Models Methods Appl Sci 1(4):300–307
Erdős P, Rényi A (1960) On the evolution of random graphs. In: Publication of the Mathematical Institute of the Hungarian Academy of Sciences, pp 17–61
Fang Z, Wang J, Liu B, Gong W (2011) Wikipedia as domain knowledge networks: domain extraction and statistical measurement. In: Proceedings of international conference on knowledge discovery and information retrieval (KDIR 2011), Paris, Oct 2011
Gong W (2009) Will the internet soon outsmart humans? Invited talk at the Department of Computer Science, University of Massachusetts Lowell, March 5, 2009
Jeh G, Widom J (2002) Simrank: a measure of structural-context similarity. In: Proceedings of the eighth ACM SIGKDD international conference on knowledge discovery and data mining, KDD’02. ACM, New York, pp 538–543
Kondor RI, Lafferty JD (2002) Diffusion kernels on graphs and other discrete input spaces. In: Proceedings of the nineteenth international conference on machine learning, ICML’02, pp 315–322
Lovasz L (1993) Random walks on graphs: a survey. Comb, Paul Erdos is Eighty 2:1–46
Melnik S, Garcia-Molina H, Rahm E (2002) Similarity flooding: a versatile graph matching algorithm and its application to schema matching. In: Proceedings on 18th international conference on data engineering, pp 117–128
Watts D, Strogatz S (1998) Collective dynamics of small-world networks. Nature 393(6684):440–442
Zager LA, Verghese GC (2008) Graph similarity scoring and matching. Appl Math Lett 21(1):86–94
Acknowledgements
We would like to thank Prof. Weibo Gong at University of Massachusetts Ahmerst for providing motivation of the RWT algorithm.
Author information
Authors and Affiliations
Corresponding author
Additional information
The first author was supported in part by the NSF under grant CCF-0830314. The second author was supported in part by the NSF under grants CCF-0830314, CNS-0958477, and CNS-1018422.
Rights and permissions
About this article
Cite this article
Fang, Z., Wang, J. Efficient identifications of structural similarities for graphs. J Comb Optim 27, 209–220 (2014). https://doi.org/10.1007/s10878-012-9505-8
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10878-012-9505-8