SocNL: Bayesian Label Propagation with Confidence

Yamaguchi, Yuto; Faloutsos, Christos; Kitagawa, Hiroyuki

doi:10.1007/978-3-319-18038-0_49

Yuto Yamaguchi¹⁰,
Christos Faloutsos¹¹ &
Hiroyuki Kitagawa¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9077))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

3643 Accesses
2 Citations

Abstract

How can we predict Smith’s main hobby if we know the main hobby of Smith’s friends? Can we measure the confidence in our prediction if we are given the main hobby of only a few of Smith’s friends? In this paper, we focus on how to estimate the confidence on the node classification problem. Providing a confidence level for the classification problem is important because most nodes in real world networks tend to have few neighbors, and thus, a small amount of evidence. Our contributions are three-fold: (a) novel algorithm; we propose a semi-supervised learning algorithm that converges fast, and provides the confidence estimate (b) theoretical analysis; we show the solid theoretical foundation of our algorithm and the connections to label propagation and Bayesian inference (c) empirical analysis; we perform extensive experiments on three different real networks. Specifically, the experimental results demonstrate that our algorithm outperforms other algorithms on graphs with less smoothness and low label density.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Adamic, L.A., Glance, N.: The political blogosphere and the 2004 us election: divided they blog. In: Proceedings of the 3rd International Workshop on Link Discovery, pp. 36–43. ACM (2005)
Google Scholar
Aggarwal, C.C., Li, N.: On node classification in dynamic content-based networks. In: SDM, pp. 355–366. SIAM (2011)
Google Scholar
Baluja, S., Seth, R., Sivakumar, D., Jing, Y., Yagnik, J., Kumar, S., Ravichandran, D., Aly, M.: Video suggestion and discovery for youtube: taking random walks through the view graph. In: WWW, pp. 895–904. ACM (2008)
Google Scholar
Belkin, M., Niyogi, P., Sindhwani, V.: Manifold regularization: A geometric framework for learning from labeled and unlabeled examples. The Journal of Machine Learning Research 7, 2399–2434 (2006)
MATH MathSciNet Google Scholar
Chapelle, O., Schölkopf, B., Zien, A., et al.: Semi-supervised learning, vol. 2. MIT Press, Cambridge (2006)
Google Scholar
Faloutsos, M., Faloutsos, P., Faloutsos, C.: On power-law relationships of the internet topology. In: SIGCOMM, pp. 251–262 (1999)
Google Scholar
Fang, Y., Hsu, B.-J.P., Chang, K.C.-C.: Confidence-aware graph regularization with heterogeneous pairwise features. In: SIGIR, pp. 951–960. ACM (2012)
Google Scholar
Gong, C., Tao, D., Fu, K., Yang, J.: Relish: Reliable label inference via smoothness hypothesis. In: AAAI (2014)
Google Scholar
McGlohon, M., Bay, S., Anderle, M.G., Steier, D.M., Faloutsos, C.: Snare: a link analytic system for graph labeling and risk detection. In: KDD, pp. 1265–1274. ACM (2009)
Google Scholar
Mislove, A., Viswanath, B., Gummadi, K.P., Druschel, P.: You are who you know: inferring user profiles in online social networks. In: WSDM, pp. 251–260. ACM (2010)
Google Scholar
Orbach, M., Crammer, K.: Graph-based transduction with confidence. In: Flach, P.A., De Bie, T., Cristianini, N. (eds.) ECML PKDD 2012, Part II. LNCS, vol. 7524, pp. 323–338. Springer, Heidelberg (2012)
Chapter Google Scholar
Sun, Y., Han, J., Gao, J., Yu, Y.: itopicmodel: Information network-integrated topic modeling. In: ICDM, pp. 493–502. IEEE (2009)
Google Scholar
Takac, L., Zabovsky, M.: Data analysis in public social networks. In: International Scientific Conference and International Workshop Present Day Trends of Innovations (2012)
Google Scholar
Zhou, D., Bousquet, O., Lal, T.N., Weston, J., Schölkopf, B.: Learning with local and global consistency. Advances in Neural Information Processing Systems 16(16), 321–328 (2004)
Google Scholar
Zhu, X., Ghahramani, Z., Lafferty, J., et al.: Semi-supervised learning using gaussian fields and harmonic functions. In: ICML, vol. 3, pp. 912–919 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Tsukuba, Tsukuba, Japan
Yuto Yamaguchi & Hiroyuki Kitagawa
Carnegie Mellon University, Pittsburgh, USA
Christos Faloutsos

Authors

Yuto Yamaguchi
View author publications
You can also search for this author in PubMed Google Scholar
Christos Faloutsos
View author publications
You can also search for this author in PubMed Google Scholar
Hiroyuki Kitagawa
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuto Yamaguchi .

Editor information

Editors and Affiliations

Ho Chi Minh City University of Technology, Ho Chi Minh City, Vietnam
Tru Cao
Singapore Management University, Singapore, Singapore
Ee-Peng Lim
Nanjing University, Nanjing, China
Zhi-Hua Zhou
Japan Advanced Institute of Science and Technology, Nomi City, Japan
Tu-Bao Ho
University of Hong Kong, Hong Kong, Hong Kong SAR
David Cheung
Osaka University, Osaka, Japan
Hiroshi Motoda

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yamaguchi, Y., Faloutsos, C., Kitagawa, H. (2015). SocNL: Bayesian Label Propagation with Confidence. In: Cao, T., Lim, EP., Zhou, ZH., Ho, TB., Cheung, D., Motoda, H. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2015. Lecture Notes in Computer Science(), vol 9077. Springer, Cham. https://doi.org/10.1007/978-3-319-18038-0_49

Download citation

DOI: https://doi.org/10.1007/978-3-319-18038-0_49
Published: 17 April 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-18037-3
Online ISBN: 978-3-319-18038-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics