skip to main content
10.1145/3132847.3133026acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

BL-ECD: Broad Learning based Enterprise Community Detection via Hierarchical Structure Fusion

Published: 06 November 2017 Publication History

Abstract

Employees in companies can be divided into different social communities, and those who frequently socialize with each other will be treated as close friends and are grouped in the same community. In the enterprise context, a large amount of information about the employees is available in both (1) offline company internal sources and (2) online enterprise social networks (ESNs). Each of the information sources also contain multiple categories of employees' socialization activities at the same time. In this paper, we propose to detect the social communities of the employees in companies based on the broad learning setting with both these online and offline information sources simultaneously, and the problem is formally called the "Broad Learning based Enterprise Community Detection" (BL-ECD) problem. To address the problem, a novel broad learning based community detection framework named "HeterogeneoUs Multi-sOurce ClusteRing" (HUMOR) is introduced in this paper. Based on the various enterprise social intimacy measures introduced in this paper, HUMOR detects a set of micro community structures of the employees based on each of the socialization activities respectively. To obtain the (globally) consistent community structure of employees in the company, HUMOR further fuses these micro community structures via two broad learning phases: (1) intra-fusion of micro community structures to obtain the online and offline (locally) consistent communities respectively, and (2) inter-fusion of the online and offline communities to achieve the (globally) consistent community structure of employees. Extensive experiments conducted on real-world enterprise datasets demonstrate our method can perform very well in addressing the BL-ECD problem.

References

[1]
E. Amigó, J. Gonzalo, J. Artiles, and F. Verdejo. A comparison of extrinsic clustering evaluation metrics based on formal constraints. Information Retrieval, 2009.
[2]
J. Banfield and A. Raftery. Model-based Gaussian and non-Gaussian clustering. Biometrics, 1993.
[3]
S. Basu, M. Bilenko, and R. Mooney. A probabilistic framework for semi-supervised clustering. In KDD, 2004.
[4]
P. Cimiano, A. Hotho, and S. Staab. Comparing conceptual, divise and agglomerative clustering for learning taxonomies from text. In ECAI, 2004.
[5]
D. Coppersmith and S. Winograd. Matrix multiplication via arithmetic progressions. Journal of Symbolic Computation, 1990.
[6]
D. Davies and D. Bouldin. A cluster separation measure. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1979.
[7]
K. Ehrlich, C. Lin, and V. Griffiths-Fisher. Searching for experts in the enterprise: Combining text and social network analysis. In GROUP, 2007.
[8]
T. Hastie, R. Tibshirani, and J. Friedman. Hierarchical clustering. In The Elements of Statistical Learning. 2009.
[9]
F. Hopner, F. Hoppner, and F. Klawonn. Fuzzy Cluster Analysis: Methods for Classification, Data Analysis and Image Recognition. 1999.
[10]
Q. Hu, S. Xie, J. Zhang, Q. Zhu, S. Guo, and P. Yu. Heterosales: Utilizing heterogeneous social networks to identify the next enterprise customer. In WWW, 2016.
[11]
Z. Huang. Extensions to the k-means algorithm for clustering large data sets with categorical values. Data Mining Knowledge Discovery, 1998.
[12]
S. Jin, J. Zhang, P. Yu, S. Yang, and A. Li. Synergistic partitioning in multiple large scale social networks. In IEEE BigData, 2014.
[13]
T. Joachims. A probabilistic analysis of the rocchio algorithm with tfidf for text categorization. In ICML, 1997.
[14]
B. King. Step-Wise Clustering Procedures. Journal of The American Statistical Association, 1967.
[15]
W. Lin, X. Kong, P. Yu, Q. Wu, Y. Jia, and C. Li. Community detection in incomplete information networks. In WWW, 2012.
[16]
M. Newman and M. Girvan. Finding and evaluating community structure in networks. Physical Review, 2004.
[17]
N. Nguyen and R. Caruana. Consensus clusterings. In Data Mining, 2007. ICDM 2007. Seventh IEEE International Conference on, 2007.
[18]
H. Park and C. Jun. A simple and fast algorithm for k-medoids clustering. Expert Systems with Applications, 2009.
[19]
N. Park and W. Lee. Statistical grid-based clustering over data streams. SIGMOD Record, 2004.
[20]
P. Rousseeuw. Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics, 1987.
[21]
S. Schaeffer. Survey: Graph clustering. Computer Science Review, 2007.
[22]
J. Shi and J. Malik. Normalized cuts and image segmentation. TPAMI, 2000.
[23]
P. Sneath and R. Sokal. Numerical Taxonomy. The Principles and Practice of Numerical Classification. 1973.
[24]
Y. Sun, C. Aggarwal, and J. Han. Relation strength-aware clustering of heterogeneous information networks with incomplete attributes. VLDB, 2012.
[25]
A. Tung, R. Ng, L. Lakshmanan, and J. Han. Constraint-based clustering in large databases. In ICDT, 2001.
[26]
J. Ward. Hierarchical grouping to optimize an objective function. Journal of the American Statistical Association, 1963.
[27]
J. Yang and J. Leskovec. Overlapping community detection at scale: A nonnegative matrix factorization approach. In WSDM, 2013.
[28]
S. Yarosh, T. Matthews, and M. Zhou. Asking the right person: Supporting expertise selection in the enterprise. In CHI, 2012.
[29]
J. Zhang, J. Chen, S. Zhi, Y. Chang, P. Yu, and J. Han. Link prediction across aligned networks with sparse low rank matrix estimation. In ICDE, 2017.
[30]
J. Zhang, Y. Lv, and P. Yu. Friend recommendation in enterprise social networks. In CIKM, 2015.
[31]
J. Zhang and P. Yu. Community detection for emerging networks. In SDM, 2015.
[32]
J. Zhang and P. Yu. Mcd: Mutual clustering across multiple heterogeneous networks. In IEEE BigData Congress, 2015.
[33]
J. Zhang and P. Yu. Multiple anonymized social networks alignment. In ICDM, 2015.
[34]
J. Zhang, P. Yu, and Y. Lv. Organizational chart inference. In KDD, 2015.
[35]
J Zhang, P. Yu, and Y. Lv. Enterprise community detection. In ICDE, 2017.
[36]
J. Zhang, P. Yu, and Y. Lv. Enterprise employee training via project team formation. In WSDM, 2017.
[37]
J. Zhang, P. Yu, Y. Lv, and Q. Zhan. Information diffusion at workplace. In CIKM, 2016.
[38]
J. Zhang, P. Yu, and Z. Zhou. Meta-path based multi-network collective link prediction. In KDD, 2014.
[39]
Y. Zhou and L. Liu. Social influence based clustering of heterogeneous information networks. In KDD, 2013.
[40]
J. Zhu, J. Zhang, L. He, Q. Wu, Y. Jia, B. Zhou, C. Zhang, and P. Yu. Broad learning based multi-source collaborative recommendation. In CIKM, 2017.

Cited By

View all
  • (2025)Multi Self-Supervised Pre-Finetuned Transformer Fusion for Better Vehicle DetectionIEEE Transactions on Automation Science and Engineering10.1109/TASE.2024.337475922(2075-2089)Online publication date: 2025
  • (2020)On Extracting Socially Tenuous Groups for Online Social Networks with k-TrianglesIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2020.3025911(1-1)Online publication date: 2020
  • (2019)Online User Representation Learning Across Heterogeneous Social NetworksProceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3331184.3331258(545-554)Online publication date: 18-Jul-2019
  • Show More Cited By

Index Terms

  1. BL-ECD: Broad Learning based Enterprise Community Detection via Hierarchical Structure Fusion

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '17: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management
    November 2017
    2604 pages
    ISBN:9781450349185
    DOI:10.1145/3132847
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 06 November 2017

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. aligned social networks
    2. broad learning
    3. community detection
    4. enterprise social networks
    5. heterogeneous information network
    6. network embedding

    Qualifiers

    • Research-article

    Conference

    CIKM '17
    Sponsor:

    Acceptance Rates

    CIKM '17 Paper Acceptance Rate 171 of 855 submissions, 20%;
    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)2
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 01 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2025)Multi Self-Supervised Pre-Finetuned Transformer Fusion for Better Vehicle DetectionIEEE Transactions on Automation Science and Engineering10.1109/TASE.2024.337475922(2075-2089)Online publication date: 2025
    • (2020)On Extracting Socially Tenuous Groups for Online Social Networks with k-TrianglesIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2020.3025911(1-1)Online publication date: 2020
    • (2019)Online User Representation Learning Across Heterogeneous Social NetworksProceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3331184.3331258(545-554)Online publication date: 18-Jul-2019
    • (2019)A Survey on Food ComputingACM Computing Surveys10.1145/332916852:5(1-36)Online publication date: 13-Sep-2019
    • (2019)Research on the discovery of opinion leaders in social networksIOP Conference Series: Materials Science and Engineering10.1088/1757-899X/563/3/032009563(032009)Online publication date: 9-Aug-2019
    • (2019)A robust multi-view clustering method for community detection combining link and content informationPhysica A: Statistical Mechanics and its Applications10.1016/j.physa.2018.09.086514(396-411)Online publication date: Jan-2019
    • (2019)Link Prediction on Dynamic Heterogeneous Information NetworksComputational Data and Social Networks10.1007/978-3-030-34980-6_36(339-350)Online publication date: 11-Nov-2019
    • (2018)Broad Learning:ACM SIGKDD Explorations Newsletter10.1145/3229329.322933320:1(24-50)Online publication date: 29-May-2018
    • (2018)Advanced Analytics of Large Connected Data Based on Similarity ModelingSimilarity Search and Applications10.1007/978-3-030-02224-2_16(209-216)Online publication date: 4-Oct-2018
    • (2017)BL-MNE: Emerging Heterogeneous Social Network Embedding Through Broad Learning with Aligned Autoencoder2017 IEEE International Conference on Data Mining (ICDM)10.1109/ICDM.2017.70(605-614)Online publication date: Nov-2017

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media