Skip to main content

Parallel Peer Pressure Clustering Algorithm Based on Linear Algebra Computation

  • Conference paper
  • First Online:
Advanced Parallel Processing Technologies (APPT 2017)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10561))

Included in the following conference series:

  • 844 Accesses

Abstract

Graph clustering is pervasive in emerging “big data” applications, and is known to be quite challenging to implement on distributed memory systems. In this work, we design and implement scalable distributed-memory algorithms for peer pressure clustering using the sparse matrix infrastructures of Combinatorial BLAS, where the peer pressure clustering algorithm is represent as sparse matrix computations. For settling ties, which is the most time-consuming step in this algorithm, we design a matrix-based algorithm and provide two parallel implementations. One is based on MPI model, and the other is a hybrid programming with MPI and OpenMP. Relying on matrix algebra building blocks, our algorithm exposes a high degree of parallelism and good scalability on distributed-memory platforms. For a real instance, when the input is a permuted R-MAT graph of scale 21 with self-loops added, our MPI implementation achieves up to 809.6x speedup on 1024 cores of a Dawning supercomputer, and the hybrid implementation with MPI and OpenMP obtains 949.5x speedup on 2048 cores of the same computer.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Shah, V.B.: An interactive system for combinatorial scientific computing with an emphasis on programmer productivity, Dissertation. University of California, Santa Barbara (2007)

    Google Scholar 

  2. Techentin, R.W., et al.: Implementing iterative algorithms with SPARQL. Edbt/icdt workshops 2014, pp. 216–223 (2014)

    Google Scholar 

  3. Deweese, K., et al.: Graph clustering in SPARQL. In: SIAM Workshop on Network Science, vol. 34, pp. 930–941 (2013)

    Google Scholar 

  4. Gilbert, J.R., Reinhardt, S., Shah, V.B.: High-performance graph algorithms from parallel sparse matrices. In: Kågström, B., Elmroth, E., Dongarra, J., Waśniewski, J. (eds.) PARA 2006. LNCS, vol. 4699, pp. 260–269. Springer, Heidelberg (2007). doi:10.1007/978-3-540-75755-9_32

    Chapter  Google Scholar 

  5. Harary, F.: Graph Theory. Addison-Wesley, Boston (1969)

    MATH  Google Scholar 

  6. Buluç, A., Gilbert, J.R.: Highly parallel sparse matrix-matrix multiplication. Computer Science, abs/1006.2183 (2010)

    Google Scholar 

  7. Buluç, A., Gilbert, J.R.: Parallel sparse matrix-matrix multiplication and indexing: Implementation and experiments. SIAM J. Sci. Comput. 34(4), C170–C191 (2012)

    Article  MathSciNet  MATH  Google Scholar 

  8. Buluç, A., Gilbert, J.R.: Challenges and advances in parallel sparse matrix-matrix multiplication. In: 37th International Conference on Parallel Processing, ICPP 2008, pp. 503–510. IEEE (2008)

    Google Scholar 

  9. Buluç, A., Gilbert, J.R.: On the representation and multiplication of hypersparse matrices. In: IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2008, pp. 1–11. IEEE (2008)

    Google Scholar 

  10. Buluç, A., Gilbert, J.R.: The combinatorial BLAS: design, implementation, and applications. Int. J. High Perform. Comput. Appl. 25(4), 496–509 (2011)

    Article  Google Scholar 

  11. Chakrabarti, D., Zhan, Y., Faloutsos, C.: R-MAT: a recursive model for graph mining. In: SDM, vol. 4, pp. 442–446 (2004)

    Google Scholar 

  12. Jeremy, K., Gilbert, J. (eds.): Graph Algorithms in the Language of Linear Algebra, vol. 22. SIAM, Philadelphia (2011)

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jun Chen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Chen, J., Zou, P. (2017). Parallel Peer Pressure Clustering Algorithm Based on Linear Algebra Computation. In: Dou, Y., Lin, H., Sun, G., Wu, J., Heras, D., Bougé, L. (eds) Advanced Parallel Processing Technologies. APPT 2017. Lecture Notes in Computer Science(), vol 10561. Springer, Cham. https://doi.org/10.1007/978-3-319-67952-5_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-67952-5_10

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-67951-8

  • Online ISBN: 978-3-319-67952-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics