skip to main content
10.1145/1014052.1014146acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

Mining scale-free networks using geodesic clustering

Published: 22 August 2004 Publication History

Abstract

Many real-world graphs have been shown to be scale-free---vertex degrees follow power law distributions, vertices tend to cluster, and the average length of all shortest paths is small. We present a new model for understanding scale-free networks based on multilevel geodesic approximation, using a new data structure called a multilevel mesh.Using this multilevel framework, we propose a new kind of graph clustering for data reduction of very large graph systems such as social, biological, or electronic networks. Finally, we apply our algorithms to real-world social networks and protein interaction graphs to show that they can reveal knowledge embedded in underlying graph structures. We also demonstrate how our data structures can be used to quickly answer approximate distance and shortest path queries on scale-free networks.

References

[1]
A. L. Barabasi, R. Albert, H. Jeong, and G. Bianconi. Power-law distribution of the world wide web. Science, 287, 2000.
[2]
B. Bollobas. Random Graphs. Cambridge University Press, second edition, 2001.
[3]
S. Brin and L. Page. The anatomy of a large-scale hypertextual Web search engine. In Proceedings of the Seventh International Conference on World Wide Web 7, pages 107--117. Elsevier Science Publishers B. V., 1998.
[4]
P. Domingos and M. Richardson. Mining the network value of customers. In Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 57--66. ACM Press, 2001.
[5]
Z. Drezner and H. W. Hamacher. Facility Location: Applications and Theory. Springer, 2002.
[6]
M. Garland. Multiresolution modeling: Survey & future opportunities. In State of the Art Report, pages 111--131. Eurographics, Sept. 1999.
[7]
L. Holder, D. Cook, and S. Djoko. Substructure discovery in the subdue system. In Proceedings of the Workshop on Knowledge Discovery in Databases, pages 169--180, 1994.
[8]
H. Jeong, S. P. Mason, A. L. Barabasi, and Z. Oltvai. Lethality and centrality in protein networks. In Nature, volume 411, pages 41--42, 2001.
[9]
J. M. Kleinberg. Authoritative sources in a hyperlinked environment. J. ACM, 46(5):604--632, 1999.
[10]
V. Krebs. Mapping networks of terrorist cells. Connections, 24, 2001.
[11]
B. Mirkin. Mathematical Classification and Clustering. Kluwer Academic Publishers, 1996.
[12]
M. E. J. Newman. The structure and function of complex networks. In SIAM Review, volume 45, pages 167--256, 2003.
[13]
M. E. J. Newman. Detecting community structure in networks. In Eur. Phys. J. B., 2004.
[14]
J. O'Madadhain, D. Fisher, S. White, and Y. Boey. The JUNG (Java Universal Network/Graph) framework. Technical report, UC Irvine, 2003.
[15]
M. Richardson and P. Domingos. Mining knowledge-sharing sites for viral marketing. In Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 61--70. ACM Press, 2002.
[16]
K. Schloegel, G. Karypis, and V. Kumar. Graph partitioning for high performance scientific simulations. In CRPC Parallel Computing Handbook. Morgan Kaufmann, 2000.
[17]
T. Washio and H. Motoda. State of the art of graph-based data mining. SIGKDD Explor. Newsl., 5(1):59--68, 2003.
[18]
S. Wasserman and K. Faust. Social network analysis. Cambridge University Press, Cambridge, 1994.
[19]
S. White and P. Smyth. Algorithms for estimating relative importance in networks. In Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 266--275. ACM Press, 2003.
[20]
X. Yan and J. Han. gSpan: Graph-based substructure pattern mining. In Proc. 2002 Int. Conf. on Data Mining (ICDM'02), 2002.
[21]
T. Zhang, R. Ramakrishnan, and M. Livny. BIRCH: an efficient data clustering method for very large databases. In ACM SIGMOD Intl. Conf. on Management of Data, pages 103--114, June 1996.

Cited By

View all
  • (2024)Scaling law of real traffic jams under varying travel demandEPJ Data Science10.1140/epjds/s13688-024-00471-413:1Online publication date: 11-Apr-2024
  • (2021)Detection of Topology Changes in Dynamical System: An Information Theoretic ApproachCellular Automata10.1007/978-3-030-69480-7_4(26-35)Online publication date: 13-Feb-2021
  • (2020)WLeidenRDF: RDF Data Query Method based on Semantic-Enhanced Graph-Clustering Algorithm2020 International Symposium on Theoretical Aspects of Software Engineering (TASE)10.1109/TASE49443.2020.00014(33-40)Online publication date: Dec-2020
  • Show More Cited By

Index Terms

  1. Mining scale-free networks using geodesic clustering

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '04: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
    August 2004
    874 pages
    ISBN:1581138881
    DOI:10.1145/1014052
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 22 August 2004

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. clustering
    2. graphs
    3. scale-free networks
    4. social networks

    Qualifiers

    • Article

    Conference

    KDD04

    Acceptance Rates

    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Upcoming Conference

    KDD '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Scaling law of real traffic jams under varying travel demandEPJ Data Science10.1140/epjds/s13688-024-00471-413:1Online publication date: 11-Apr-2024
    • (2021)Detection of Topology Changes in Dynamical System: An Information Theoretic ApproachCellular Automata10.1007/978-3-030-69480-7_4(26-35)Online publication date: 13-Feb-2021
    • (2020)WLeidenRDF: RDF Data Query Method based on Semantic-Enhanced Graph-Clustering Algorithm2020 International Symposium on Theoretical Aspects of Software Engineering (TASE)10.1109/TASE49443.2020.00014(33-40)Online publication date: Dec-2020
    • (2019)An improved RDF data Clustering AlgorithmProcedia Computer Science10.1016/j.procs.2019.01.038148(208-217)Online publication date: 2019
    • (2019)A review and proposal of (fuzzy) clustering for nonlinearly separable dataInternational Journal of Approximate Reasoning10.1016/j.ijar.2019.09.004Online publication date: Sep-2019
    • (2018)A Multi-Granularity Backbone Network Extraction Method Based on the Topology PotentialComplexity10.1155/2018/86041322018(1-8)Online publication date: 22-Oct-2018
    • (2017)Graph Summarization Based on Attribute-Connected NetworkWeb and Big Data10.1007/978-3-319-69781-9_16(161-171)Online publication date: 8-Nov-2017
    • (2016)Social community detection based on node distance and interestProceedings of the 3rd IEEE/ACM International Conference on Big Data Computing, Applications and Technologies10.1145/3006299.3006342(274-289)Online publication date: 6-Dec-2016
    • (2016)On detecting communities in social networks with interests2016 12th International Conference on Innovations in Information Technology (IIT)10.1109/INNOVATIONS.2016.7880053(1-5)Online publication date: Nov-2016
    • (2016)Summarizing scale-free networks based on virtual and real linksPhysica A: Statistical Mechanics and its Applications10.1016/j.physa.2015.08.048444(360-372)Online publication date: Feb-2016
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media