Abstract
We consider here the task of multi-label classification for data organized in a multi-relational graph. We propose the IMMCA model—Iterative Multi-label Multi-relational Classification Algorithm—a general algorithm for solving the inference and learning problems for this task. Inference is performed iteratively by propagating scores according to the multi-relational structure of the data. We detail two instances of this general model, implementing two different label propagation schemes on the multigraph. To the best of our knowledge, this is the first collective classification method able to handle multiple relations and to perform multi-label classification in multigraphs. Tests are performed for two generic applications, image annotation and document classification, on different social datasets. For image annotation, we have been using Flickr datasets of different sizes and with different configurations, with multiple relations such as authorship, friendship, or textual similarities. For document classification, we used the Cora classical benchmark plus an Email corpus. Additional experiments on artificial data allow us to analyze further the behavior of the model.
Similar content being viewed by others
Notes
We consider that l ≪ u and l + u = N.
It has been initially proposed for classifying the nodes of a fully unlabeled graph, using a set of labeled graphs for training, and then extended to partially labeled graphs.
Note that a ranking model has also been formulated but is not presented here.
Other combinations of propagation and content have been tested and we present here the simplest and most efficient one.
References
Abernethy J, Chapelle O, Castillo C (2008) Web spam identification through content and hyperlinks. In: Proceedings of AIRWeb ’08. ACM, New York, NY, USA. doi:10.1145/1451983.1451994
Agarwal S (2006) Ranking on graph data. In: Proceedings of the 23rd international conference on machine learning ICML ’06. ACM, New York, NY, USA, pp 25–32
Argyriou A, Herbster M, Pontil M (2005) Combining graph Laplacians for semi-supervised learning. In: Proceedings on advances in neural information processing systems, vol 18, pp 67–74
Benchettara N, Kanawati R, Rouveirol C (2010) Supervised machine learning applied to link prediction in bipartite social networks. In: Proceedings of ASONAM, pp 326–330
Bhagat S, Cormode G, Rozenbaum I (2009) Applying link-based classification to label blogs. In: Zhang H, Spiliopoulou M, Mobasher B, Giles CL, Mccallum A, Nasraoui O, Srivastava J, Yen J (eds) Advances in web mining and web usage analysis. Springer, Berlin, pp 97–117
Bhattacharyya P, Garg A, Wu S (2010) Analysis of user keyword similarity in online social networks. Social Netw Anal Min 1–16. doi:10.1007/s13278-010-0006-4
Bilgic M, Namata G, Getoor L (2007) Combining collective classification and link prediction. In: Proceedings of ICDM workshops, pp 381–386
Blondel VD, Guillaume JL, Lambiotte R, Lefebvre E (2008) Fast unfolding of community hierarchies in large networks. CoRR, abs/0803.0476
Cao L, Luo J, Huang TS (2008) Annotating photo collections by label propagation according to multiple similarity cues. In: Proceedings of MM ’08
Chen H, Li L, Peng J (2009) Error bounds of multi-graph regularized semi-supervised classification. Inform Sci 179(12):1960–1969 (Special Section: Web Search)
Clauset A, Moore C, Newman MEJ (2006) Structural inference of hierarchies in networks. CoRR, abs/physics/0610051
Denoyer L, Gallinari P (2010) A ranking based model for automatic image annotation in a social network. Tech. rep. University Pierre et Marie Curie
Esslimani I, Brun A, Boyer A (2010) Densifying a behavioral recommender system by social networks link prediction methods. Social Netw Anal Min 1–14. doi:10.1007/s13278-010-0004-6
Ghosh R, Lerman K (2008) Community detection using a measure of global influence. In: Proceedings of SNAKDD, pp 20–35
Gjoka M, Butts CT, Kurant M, Markopoulou A (2010) Multigraph sampling of online social networks. CoRR, abs/1008.2565
Hill S, Provost FJ, Volinsky C (2007) Learning and inference in massive social networks. In: Proceedings of MLG
Jensen D, Neville J, Gallagher B (2004) Why collective inference improves relational classification. In: Proceedings of KDD ’04. ACM, New York, NY, USA, pp 593–598
Kato T, Kashima H, Sugiyama M (2008) Integration of multiple networks for robust label propagation. In: Proceedings of SDM, pp 716–726
Kayaalp M, Zyer T, Zyer S (2010) A mash-up application utilizing hybridized filtering techniques for recommending events at a social networking site. Social Netw Anal Min 1–9. doi:10.1007/s13278-010-0010-8
Kleinberg JM (1999) Hubs, authorities, and communities. ACM Comput Surv 31
Kou Z, Cohen WW (2007) Stacked graphical models for efficient inference in markov random fields. In: Proceedings of SDM
Li W (2008) Real-time computerized annotation of pictures. In: Proceedings of IEEE transactions on pattern analysis and machine intelligence, pp 985–1002
Li Y, Zaragoza H, Herbrich R, Shawe-Taylor J, Kandola JS (2002) The perceptron algorithm with uneven margins. In: Proceedings of ICML, pp 379–386
Liu Y, Mei T, Wu X, Hua XS (2009) Multigraph-based query-independent learning for video search. IEEE Trans Circuits Syst Video Technol 19:1841–1850
Ma H, Zhou D, Liu C, Lyu MR, King I (2011) Recommender systems with social regularization. In: Proceedings of the fourth ACM international conference on web search and data mining, WSDM ’11. ACM, New York, NY, USA, pp 287–296
Macskassy S, Provost F (2003) A simple relational classifier. In: Proceedings of the ninth ACM SIGKDD international conference on knowledge discovery and data mining, second workshop on multi-relational data mining
Maes F, Peters S, Denoyer L, Gallinari P (2009) Simulated iterative classification a new learning procedure for graph labeling. In: Proceedings of ECML/PKDD, no. 2, pp 47–62
Magnani M, Montesi D, Rossi L (2010) Information propagation analysis in a social network site. In: Proceedings of ASONAM, pp 296–300
Monay F, Gatica-Perez D (2003) On image auto-annotation with latent space models. In: Proceedings of MULTIMEDIA ’03. ACM, New York, NY, USA, pp 275–278. doi:10.1145/957013.957070
Rosenblatt F (1958) The perceptron: a probabilistic model for information storage and organization in the brain. Psychol Rev 65(6):386–408
Saravanan M, Prasad G, Karishma S, Suganthi D (2011) Analyzing and labeling telecom communities using structural properties. Social Netw Anal Min 1–16. doi:10.1007/s13278-011-0020-1
Sen P, Namata GM, Bilgic M, Getoor L, Gallagher B, Eliassi-Rad T (2008) Collective classification in network data. Tech. Rep. CS-TR-4905. University of Maryland, College Park
Sigurbjörnsson B, van Zwol R (2008) Flickr tag recommendation based on collective knowledge. In: Proceedings of WWW ’08. ACM, New York, NY, USA, pp 327–336
Vapnik VN (1995) The nature of statistical learning theory. Springer-Verlag New York, Inc., New York, NY, USA
Wang M, Hua XS, Hong R, Tang J, Qi GJ, Song Y (2009) Unified video annotation via multigraph learning. IEEE Trans Circuits Syst Video Technol 19:733–746
Wang M, Hua XS, Yuan X, Song Y, Dai LR (2007) Optimizing multi-graph learning: towards a unified video annotation scheme. In: Proceedings of the 15th international conference on multimedia, MULTIMEDIA ’07. ACM, New York, NY, USA, pp 862–871
Wu L, Yang L, Yu N, Hua XS (2009) Learning to tag. In: Proceedings of WWW, pp 361–370
Zhou D, Bousquet O, Lal TN, Weston J, Schölkopf B (2003) Learning with local and global consistency. In: Proceedings of NIPS
Zhou D, Huang J, Schölkopf B (2005) Learning from labeled and unlabeled data on a directed graph. In: Proceedings of ICML, pp 1036–1043
Zhu X, Ghahramani Z, Lafferty JD (2003) Semi-supervised learning using gaussian fields and harmonic functions. In: Proceedings of ICML
Acknowledgments
This work was partially supported by the French National Agency of Research (Fragrances, ANR-08-CORD-008-01 and ExDeus/Cedres, ANR-09-CORD-010-04, Projects).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Peters, S., Jacob, Y., Denoyer, L. et al. Iterative Multi-label Multi-relational Classification Algorithm for complex social networks. Soc. Netw. Anal. Min. 2, 17–29 (2012). https://doi.org/10.1007/s13278-011-0034-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13278-011-0034-8