Skip to main content

Graph Clustering

  • Reference work entry
  • First Online:
Encyclopedia of Machine Learning and Data Mining
  • 124 Accesses

Synonyms

Minimum cuts; Network clustering; Spectral clustering; Structured data clustering

Definition

Graph clustering refers to clustering of data in the form of graphs. Two distinct forms of clustering can be performed on graph data. Vertex clustering seeks to cluster the nodes of the graph into groups of densely connected regions based on either edge weights or edge distances. The second form of graph clustering treats the graphs as the objects to be clustered and clusters these objects on the basis of similarity. The second approach is often encountered in the context of structured or XML data.

Motivation and Background

Graph clustering is a form of graph mining that is useful in a number of practical applications including marketing, customer segmentation, congestion detection, facility location, and XML data integration (Lee et al. 2002). The graph clustering problems are typically defined into two categories:

  • Node clustering algorithms: Node clustering algorithms are...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 699.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 949.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Recommended Reading

  • Abello J, Resende MG, Sudarsky S (2002) Massive quasi-clique detection. In: Proceedings of the 5th Latin American symposium on theoretical informatics (LATIN). Springer, Berlin, pp 598–612

    Google Scholar 

  • Aggarwal C, Ta N, Feng J, Wang J, Zaki MJ (2007) XProj: a framework for projected structural clustering of XML documents. In: KDD conference, San Jose, pp 46–55

    Google Scholar 

  • Ahuja R, Orlin J, Magnanti T (1992) Network flows: theory, algorithms, and applications. Prentice-Hall, Englewood Cliffs

    MATH  Google Scholar 

  • Chawathe SS (1999) Comparing hierachical data in external memory. In: Very large data bases conference. Morgan Kaufmann, San Francisco, pp 90–101

    Google Scholar 

  • Chung F (1997) Spectral graph theory. Conference Board of the Mathematical Sciences, Washington, DC

    MATH  Google Scholar 

  • Dalamagas T, Cheng T, Winkel K, Sellis T (2005) Clustering XML documents using structural summaries. In: Information systems. Elsevier, Jan 2005

    Google Scholar 

  • Gibson D, Kumar R, Tomkins A (2005) Discovering large dense subgraphs in massive graphs. In: VLDB conference, pp 721–732. http://www.vldb2005.org/program/paper/thu/p721-gibson.pdf

  • Jain A, Dubes R (1998) Algorithms for clustering data. Prentice-Hall, Englewood

    MATH  Google Scholar 

  • Kernighan BW, Lin S (1970) An efficient heuristic procedure for partitioning graphs. Bell Syst Tech J 49:291–307

    Article  MATH  Google Scholar 

  • Lee M, Hsu W, Yang L, Yang X (2002) XClust: clustering XML schemas for effective integration. In: ACM conference on information and knowledge management. http://doi.acm.org/10.1145/584792.584841

  • Lian W, Cheung DW, Mamoulis N, Yiu S (2004) An efficient and scalable algorithm for clustering XML documents by structure. IEEE Trans Knowl Data Eng 16(1):82–96

    Article  Google Scholar 

  • Pei J, Jiang D, Zhang A (2005) On mining cross-graph quasi-cliques. In: ACM KDD conference, Chicago

    Chapter  Google Scholar 

  • Rattigan M, Maier M, Jensen D (2007) Graph clustering with network structure indices. In: Proceedings of the international conference on machine learning. ACM, New York, pp 783–790

    Google Scholar 

  • Tsay AA, Lovejoy WS, Karger DR (1999) Random sampling in cut, flow, and network design problems. Math Oper Res 24(2):383–413

    Article  MathSciNet  Google Scholar 

  • Zeng Z, Wang J, Zhou L, Karypis G (2007) Out-of-core coherent closed quasi-clique mining from large dense graph databases. ACM Trans Database Syst 32(2):13

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer Science+Business Media New York

About this entry

Cite this entry

Aggarwal, C.C. (2017). Graph Clustering. In: Sammut, C., Webb, G.I. (eds) Encyclopedia of Machine Learning and Data Mining. Springer, Boston, MA. https://doi.org/10.1007/978-1-4899-7687-1_348

Download citation

Publish with us

Policies and ethics