Abstract
Previous work in network analysis has focused on modeling the roles of nodes in graphs. In this paper, we introduce edge role discovery and propose a framework for learning and extracting edge roles from large graphs. We also propose a general class of higher-order role models that leverage network motifs. This leads us to develop a novel edge feature learning approach for role discovery that begins with higher-order network motifs and automatically learns deeper edge features. All techniques are parallelized and shown to scale well. They are also efficient with a time complexity of \(\mathcal {O}(|E|)\). The experiments demonstrate the effectiveness of our model for a variety of ML tasks such as improving classification and dynamic network analysis.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
4-vertex induced subgraphs (graphlets, motifs) and larger.
- 2.
The representation cost of correcting approximation errors.
- 3.
Note \(\log _2(m)\) quantization bins are used.
- 4.
We note that MDL is used in Fig. 1, though AIC/BIC gave similar results.
References
Ahmed, N.K., Neville, J., Rossi, R.A., Duffield, N.: Efficient graphlet counting for large networks. In: ICDM, p. 10 (2015)
Ahmed, N.K., Neville, J., Rossi, R.A., Duffield, N., Willke, T.L.: Graphlet decomposition: framework, algorithms, and applications. KAIS 50(3), 1–32 (2016)
Ahmed, N.K., Willke, T.L., Rossi, R.A.: Estimation of local subgraph counts. In: IEEE BigData, pp. 1–10 (2016)
Akaike, H.: A new look at the statistical model identification. TOAC 19(6), 716–723 (1974)
Anderson, C., Wasserman, S., Faust, K.: Building stochastic blockmodels. Soc. Netw. 14(1), 137–161 (1992)
Arabie, P., Boorman, S., Levitt, P.: Constructing blockmodels: how and why. J. Math. Psychol. 17(1), 21–63 (1978)
Batagelj, V., Mrvar, A., Ferligoj, A., Doreian, P.: Generalized blockmodeling with pajek. Metodoloski Zvezki 1, 455–467 (2004)
Bennett, W.R.: Spectra of quantized signals. Bell Syst. Tech. 27(3), 446–472 (1948)
Borgatti, S., Everett, M., Johnson, J.: Analyzing Social Networks. SAGE Publications, Thousand Oaks (2013)
Bregman, L.M.: The relaxation method of finding the common point of convex sets. USSR Comput. Math. Math. Phys. 7(3), 200–217 (1967)
Doreian, P., Batagelj, V., Ferligoj, A.: Generalized Blockmodeling, vol. 25. Cambridge University Press, Cambridge (2005)
Getoor, L., Taskar, B. (eds.): Introduction to Statistical Relational Learning. MIT Press, Cambridge (2007)
Grünwald, P.D.: The Minimum Description Length Principle. MIT Press, Cambridge (2007)
Henderson, K., et al.: Rolx: structural role extraction & mining in large graphs. In: KDD, pp. 1231–1239 (2012)
Holland, P.W., Laskey, K.B., Leinhardt, S.: Stochastic blockmodels: first steps. Soc. Netw. 5(2), 109–137 (1983)
Huffman, D.A., et al.: A method for the construction of minimum-redundancy codes. Proc. IRE 40(9), 1098–1101 (1952)
Li, M., Vitányi, P.: An Introduction to Kolmogorov Complexity and Its Applications. Springer Science & Business Media, Heidelberg (2009)
Lloyd, S.: Least squares quantization in PCM. TOIT 28(2), 129–137 (1982)
Lorrain, F., White, H.: Structural equivalence of individuals in social networks. J. Math. Sociol. 1(1), 49–80 (1971)
Macskassy, S., Provost, F.: A simple relational classifier. In: KDD MRDM (2003)
Macskassy, S.A., Provost, F.: Classification in networked data: a toolkit and a univariate case study. JMLR 8, 935–983 (2007)
Max, J.: Quantizing for minimum distortion. TOIT 6(1), 7–12 (1960)
Nowicki, K., Snijders, T.: Estimation and prediction for stochastic blockstructures. J. Am. Stat. Assoc. 96(455), 1077–1087 (2001)
Oliver, B., Pierce, J., Shannon, C.E.: The philosophy of PCM. IRE 36(11), 1324–1331 (1948)
Rahman, M., Hasan, M.A.: Link prediction in dynamic networks using graphlet. In: Frasconi, P., Landwehr, N., Manco, G., Vreeken, J. (eds.) ECML PKDD 2016. LNCS (LNAI), vol. 9851, pp. 394–409. Springer, Cham (2016). doi:10.1007/978-3-319-46128-1_25
Rissanen, J.: Modeling by shortest data description. Automatica 14(5), 465–471 (1978)
Rossi, R.A., Ahmed, N.K.: The network data repository with interactive graph analytics and visualization. In: AAAI (2015). http://networkrepository.com
Rossi, R.A., Ahmed, N.K.: Role discovery in networks. TKDE 27(4), 1112 (2015)
Rossi, R.A., Gallagher, B., Neville, J., Henderson, K.: Role-dynamics: fast mining of large dynamic networks. In: WWW Companion, pp. 997–1006 (2012)
Rossi, R.A., Gallagher, B., Neville, J., Henderson, K.: Modeling dynamic behavior in large evolving graphs. In: WSDM, pp. 667–676 (2013)
Rossi, R.A., McDowell, L.K., Aha, D.W., Neville, J.: Transforming graph data for statistical relational learning. JAIR 45(1), 363–441 (2012)
Rossi, R.A., Zhou, R.: Parallel collective factorization for modeling large heterogeneous networks. Soc. Netw. Anal. Mining 6(1), 30 (2016)
Schwarz, G., et al.: Estimating the dimension of a model. Ann. Stat. 6(2), 461–464 (1978)
Shannon, C.E.: A mathematical theory of communication. Bell Syst. Tech. 27(1), 379–423 (1948)
Van Leeuwen, J.: On the construction of Huffman trees. In: ICALP, p. 382 (1976)
Vishwanathan, S.V.N., Schraudolph, N.N., Kondor, R., Borgwardt, K.M.: Graph kernels. JMLR 11, 1201–1242 (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Ahmed, N.K., Rossi, R.A., Willke, T.L., Zhou, R. (2017). Edge Role Discovery via Higher-Order Structures. In: Kim, J., Shim, K., Cao, L., Lee, JG., Lin, X., Moon, YS. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2017. Lecture Notes in Computer Science(), vol 10234. Springer, Cham. https://doi.org/10.1007/978-3-319-57454-7_23
Download citation
DOI: https://doi.org/10.1007/978-3-319-57454-7_23
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-57453-0
Online ISBN: 978-3-319-57454-7
eBook Packages: Computer ScienceComputer Science (R0)