Skip to main content

A Load Balancing Knapsack Algorithm for Parallel Fuzzy c-Means Cluster Analysis

  • Conference paper
Book cover High Performance Computing for Computational Science - VECPAR 2008 (VECPAR 2008)

Abstract

This work proposes a load balance algorithm to parallel processing based on a variation of the classical knapsack problem. The problem considers the distribution of a set of partitions, defined by the number of clusters, over a set of processors attempting to achieve a minimal overall processing cost.

The work is an optimization for the parallel fuzzy c-means (FCM) clustering analysis algorithm proposed in a previous work composed by two distinct parts: the cluster analysis, properly said, using the FCM algorithm to calculate of clusters centers and the PBM index to evaluate partitions, and the load balance, which is modeled by the multiple knapsack problem and implemented through a heuristic that incorporates the restrictions related to cluster analysis in order to gives more efficiency to the parallel process.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Boutsinas, B., Gnardellis, T.: On distributing the clustering process. Pattern Recognition Letters 23, 999–1008 (2002)

    Article  MATH  Google Scholar 

  2. Rahimi, S., Zargham, M., Thakre, A., Chhillar, D.: A parallel Fuzzy C-Mean algorithm for image segmentation. In: Proceedings of the IEEE Annual Meeting of the Fuzzy Information NAFIPS 2004, vol. 1, pp. 234–237 (2004)

    Google Scholar 

  3. Modenesi, M.V., Costa, M.A., Evsukoff, A.G., Ebecken, N.F.F.: Parallel Fuzzy c-Means Cluster Analysis. In: Daydé, M., Palma, J.M.L.M., Coutinho, Á.L.G.A., Pacitti, E., Lopes, J.C. (eds.) VECPAR 2006. LNCS, vol. 4395, pp. 139–142. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  4. Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Computing Surveys 31(3), 264–323 (1999)

    Article  Google Scholar 

  5. Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum, New York (1981)

    Book  MATH  Google Scholar 

  6. Xie, X.L., Beni, G.A.: Validity measure for fuzzy clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence 3(8), 841–846 (1991)

    Article  Google Scholar 

  7. Bezdek, J., Pal, N.R.: Some new indexes of cluster validity. IEEE Trans. Systems Man and Cybernetics B 28, 301–315 (1998)

    Article  Google Scholar 

  8. Pakhira, M.K., Bandyopadhyay, S., Maulik, U.: Validity index for crisp and fuzzy clusters. Pattern Recognition 37, 487–501 (2004)

    Article  MATH  Google Scholar 

  9. Dantzig, G.B.: Discrete Variable Extremum Problems. Operations Research 5, 266–277 (1957)

    Article  MathSciNet  Google Scholar 

  10. Mitten, L.G.: Branch-And-Bound Methods: General Formulation and Properties. Operations Research 18(1), 24–34 (1970)

    Article  MathSciNet  MATH  Google Scholar 

  11. Martello, S., Toth, P.: Knapsack Problems, Algorithms and Computer Implementations. John Wiley & Sons, Chichester (1990)

    MATH  Google Scholar 

  12. Pissinger, D., Toth, P.: Knapsak problems. In: Du, D.-Z., Pardalos, P. (eds.) Handbook of Combinatorial Optimization, vol. 1. Kluwer Academic Publishers, Dordrecht (1998)

    Google Scholar 

  13. Johnson, D.: Near-Optimal Bin Packing Algorithms, Doctoral Thesis. MIT, Cambridge (1973)

    Google Scholar 

  14. Graham, R.L.: Bounds on multiprocessor timing anomalies. SIAM J. Appl. Math. 17, 416–429 (1966)

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Modenesi, M.V., Evsukoff, A.G., Costa, M.C.A. (2008). A Load Balancing Knapsack Algorithm for Parallel Fuzzy c-Means Cluster Analysis. In: Palma, J.M.L.M., Amestoy, P.R., Daydé, M., Mattoso, M., Lopes, J.C. (eds) High Performance Computing for Computational Science - VECPAR 2008. VECPAR 2008. Lecture Notes in Computer Science, vol 5336. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-92859-1_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-92859-1_24

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-92858-4

  • Online ISBN: 978-3-540-92859-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics