Skip to main content
Log in

Group extraction from professional social network using a new semi-supervised hierarchical clustering

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

Recently, social network has been given much attention. This paper addresses the issue of extraction groups from professional social network and enriches the representation of the user profile and its related groups through building a social network warehousing. Several criteria may be applied to detect groups within professional communities, such as the area of expertise, the job openings proposed by the group, the security of the group, and the time of the group creation. In this paper, we aim to find, extract, and fuse the LinkedIn users. Indeed, we deal with the group extraction of LinkedIn users based on their profiles using our innovative semi-supervised clustering method based on quantitative constraints ranking. The encouraging experimental results carried out on our real professional warehouse show the usefulness of our approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Notes

  1. The data warehouse is built using the available information at http://www.linkedin.com.

  2. http://www.cs.waikato.ac.nz/~ml/weka/.

References

  1. Bade K, Hermkes M, Nürnberger A (2007) User oriented hierarchical information organization and retrieval. In: Proceedings of the 18th European conference on machine learning. LNCS, pp 518–526

  2. Basu S, Banerjee A, Mooney RJ (2002) Semi-supervised clustering by seeding. In: International conference on machine learning, pp 27–34

  3. Ben Ahmed E, Nabli A, Gargouri F (2012) SHACUN: semi-supervised hierarchical active clustering based on ranking constraints. In: 12th Industrial conference on data mining, Berlin, Germany, pp 194–208

  4. Benitez E, Collet C, Adiba, Entrepôts de données M (2001) caractéristiques et problématique. Revue TSI 20(2)

  5. Bohm C, Plant C (2008) Hissclu: a hierarchical density-based method for semi-supervised clustering. In: Proceedings of the 11th international conference on extending database technology, New York, NY, USA, pp 440–451

  6. Dasgupta S, Ng V (2010) Which clustering do you want? Inducing your ideal clustering with minimal feedback. J Artif Intell Res 39:581–632

    MATH  MathSciNet  Google Scholar 

  7. Davidson I, Ravi SS (2009) Using instance-level constraints in agglomerative hierarchical clustering: theoretical and empirical results. Data Min Knowl Discov 18(2):257–282

    Google Scholar 

  8. Ghosh Strehl J, Mooney R (2000) Impact of similarity measures on web-page clustering. In: Workshop on artificial intelligence for web search (AAI-2000)

  9. Girvan M, Newman MEJ (2002) Community structure in social and biological networks. Proc Natl Acad Sci USA 99(12):78217826

    Article  MathSciNet  Google Scholar 

  10. Gleiser P, Danon L (2003) Community structure in jazz. Adv Complex Syst 6:565

    Article  Google Scholar 

  11. Golfarelli M, Maio D, Rizzi S (1998) Conceptual design of data warehouses from E/R schemes. In: 31st Hawaii international conference on system sciences

  12. Huang R, Lam W (2009) An active learning framework for semi-supervised document clustering with language modeling. Data Knowl Eng 68(1):49–67

    Article  Google Scholar 

  13. Jaccard P (1908) Nouvelles recherches sur la distribution florale. Bulletin de la Société de Vaudoise des Sciences Naturelles, 44:223–270

    Google Scholar 

  14. Jain AK, Dubes RC (1988) Algorithms for clustering data. Prentice-Hall, Upper Saddle River

    MATH  Google Scholar 

  15. Kamvar SD, Klein D, Manning CD (2003) Spectral learning. In: International joint conference on artificial intelligence, pp 561–566

  16. Kestler HA, Kraus JM, Palm G, Schwenker F (2006) On the effects of constraints in semi-supervised hierarchical clustering. Artificial Neural Network Pattern Recognition. LNCS, pp 57–66

  17. Kietzmann HJ, Hermkens K (2011) Social media? Get serious! Understanding the functional building blocks of social media. Bus Horiz 54:241251

    Article  Google Scholar 

  18. Klein D, Kamvar SD, Manning CD (2002) From instance-level constraints to space-level constraints: making the most of prior knowledge in data clustering. In: Proceedings of the 19th international conference on machine learning, CA, USA, pp 307–314

  19. MacQueen J (1967) Some methods for classification and analysis of multivariate observations. In: 5th Berkeley symposium on mathematics and statistical probability

  20. Nogueira BM, Jorge AM, Rezende SO (2012) Hierarchical confidence-based active clustering. In: The 27th symposium on applied computing, pp 535–537

  21. Palla G, Barabasi A-L, Vicsek T (2007) Quantifying social group evolution. Nature 446:664–667

    Article  Google Scholar 

  22. Palomares I, Liu J, Xu Y, Martinez L (2012) Modelling experts’ attitudes in group decision making. J Soft Comput 16(10):1755–1766

    Google Scholar 

  23. Quinlan JR (1986) Induction of decision trees. Mach Learn 1(1):81–106

    Google Scholar 

  24. Radicchi F, Castellano C, Cecconi F, Loreto V, Parisi D (2004) Defining and identifying communities in networks. Proc Natl Acad Sci 101(2):2658–2663

    Article  Google Scholar 

  25. Traud AL, Kelsic ED, Mucha PJ, Porter MA Community structure in online collegiate social networks [eprint arXiv:0809.0690]

  26. Tung AKH, Han J, Lakshmanan LVS, Ng RT (2001) Constraint-based clustering in large databases. In: Proceedings of the international conference on database theory (ICDT ’01), London, UK

  27. Wagstaff K, Cardie C, Rogers S, Schroedel S (2001) Constrained K-means clustering with background knowledge. In: International conference on machine learning, pp 577–584

  28. Wasserman S, Faust K (1994) Social network analysis: methods and applications. Cambridge University Press, New York

    Book  Google Scholar 

  29. Xing EP, Andrew YNg, Jordan MI, Russell S (2002) Distance metric learning with application to clustering with side-information. In:Advances in neural information processing system pp 505–512

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Eya Ben Ahmed.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ben Ahmed, E., Nabli, A. & Gargouri, F. Group extraction from professional social network using a new semi-supervised hierarchical clustering. Knowl Inf Syst 40, 29–47 (2014). https://doi.org/10.1007/s10115-013-0634-x

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-013-0634-x

Keywords

Navigation