Abstract
This work proposes a load balance algorithm to parallel processing based on a variation of the classical knapsack problem. The problem considers the distribution of a set of partitions, defined by the number of clusters, over a set of processors attempting to achieve a minimal overall processing cost.
The work is an optimization for the parallel fuzzy c-means (FCM) clustering analysis algorithm proposed in a previous work composed by two distinct parts: the cluster analysis, properly said, using the FCM algorithm to calculate of clusters centers and the PBM index to evaluate partitions, and the load balance, which is modeled by the multiple knapsack problem and implemented through a heuristic that incorporates the restrictions related to cluster analysis in order to gives more efficiency to the parallel process.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Boutsinas, B., Gnardellis, T.: On distributing the clustering process. Pattern Recognition Letters 23, 999–1008 (2002)
Rahimi, S., Zargham, M., Thakre, A., Chhillar, D.: A parallel Fuzzy C-Mean algorithm for image segmentation. In: Proceedings of the IEEE Annual Meeting of the Fuzzy Information NAFIPS 2004, vol. 1, pp. 234–237 (2004)
Modenesi, M.V., Costa, M.A., Evsukoff, A.G., Ebecken, N.F.F.: Parallel Fuzzy c-Means Cluster Analysis. In: Daydé, M., Palma, J.M.L.M., Coutinho, Á.L.G.A., Pacitti, E., Lopes, J.C. (eds.) VECPAR 2006. LNCS, vol. 4395, pp. 139–142. Springer, Heidelberg (2007)
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Computing Surveys 31(3), 264–323 (1999)
Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum, New York (1981)
Xie, X.L., Beni, G.A.: Validity measure for fuzzy clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence 3(8), 841–846 (1991)
Bezdek, J., Pal, N.R.: Some new indexes of cluster validity. IEEE Trans. Systems Man and Cybernetics B 28, 301–315 (1998)
Pakhira, M.K., Bandyopadhyay, S., Maulik, U.: Validity index for crisp and fuzzy clusters. Pattern Recognition 37, 487–501 (2004)
Dantzig, G.B.: Discrete Variable Extremum Problems. Operations Research 5, 266–277 (1957)
Mitten, L.G.: Branch-And-Bound Methods: General Formulation and Properties. Operations Research 18(1), 24–34 (1970)
Martello, S., Toth, P.: Knapsack Problems, Algorithms and Computer Implementations. John Wiley & Sons, Chichester (1990)
Pissinger, D., Toth, P.: Knapsak problems. In: Du, D.-Z., Pardalos, P. (eds.) Handbook of Combinatorial Optimization, vol. 1. Kluwer Academic Publishers, Dordrecht (1998)
Johnson, D.: Near-Optimal Bin Packing Algorithms, Doctoral Thesis. MIT, Cambridge (1973)
Graham, R.L.: Bounds on multiprocessor timing anomalies. SIAM J. Appl. Math. 17, 416–429 (1966)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Modenesi, M.V., Evsukoff, A.G., Costa, M.C.A. (2008). A Load Balancing Knapsack Algorithm for Parallel Fuzzy c-Means Cluster Analysis. In: Palma, J.M.L.M., Amestoy, P.R., Daydé, M., Mattoso, M., Lopes, J.C. (eds) High Performance Computing for Computational Science - VECPAR 2008. VECPAR 2008. Lecture Notes in Computer Science, vol 5336. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-92859-1_24
Download citation
DOI: https://doi.org/10.1007/978-3-540-92859-1_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-92858-4
Online ISBN: 978-3-540-92859-1
eBook Packages: Computer ScienceComputer Science (R0)