Skip to main content

An Adaptive Genetic Algorithm Approach for Optimizing Feature Weights in Multimodal Clustering

  • Conference paper
  • First Online:
Intelligent Computing (SAI 2020)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1229))

Included in the following conference series:

  • 710 Accesses

Abstract

Social Media is a popular channel of communication, where people exchange different types of high volume and multimodal data. Cluster analysis is used to categorize this data to extract useful information. However, the variation of features that can be used in clustering makes the clustering process difficult, since some features may be more important than others, and some may be irrelevant or redundant. An alternative to traditional feature selection techniques, especially with the absence of domain knowledge, is to assign feature weights that depend on their importance in the clustering process. In this paper, we introduce a multimodal adaptive genetic clustering (MAGC) algorithm that clusters data according to multiple features. This is done by adding feature weights as an extension to the clustering solution. In other words, feature weights evolve and improve alongside the original clustering solution. In addition, the number of clusters is not determined a priori, but it is adapted and optimized during the evolutionary process as well. Our approach was tested on a large collection of Flickr images metadata and was found to perform better than a non-adaptive genetic algorithm clustering approach and to produce semantically related clusters.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Nikolopoulos, S., Giannakidou, E., Kompatsiaris, I.: Combining multi-modal features for social media analysis. In: Hoi, S.C.H., Luo, J., Boll, S., Xu, D., Jin, R., King, I. (eds.) Social Media Modeling, pp. 71–96. Springer, London (2011).

    Chapter  Google Scholar 

  2. Hruschka, E., Campello, R., Freitas, A.A.: A survey of evolutionary algorithms for clustering. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 39, 133–155 (2009)

    Google Scholar 

  3. Hansen, P., Jaumard, B.: Cluster analysis and mathematical programming. Math. Program. 79, 191–215 (1997)

    MathSciNet  MATH  Google Scholar 

  4. Dhillon, I.S., Mallela, S., Modha, D.S.: Information-theoretic co-clustering. In: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD 2003, p. 89. ACM Press, New York (2003)

    Google Scholar 

  5. Becker, H., Naaman, M., Gravano, L.: Event identification in social media. In: 12th International Workshop on the Web and Databases (WebDB), Rhode Island, USA (2009)

    Google Scholar 

  6. Sheng, W., Liu, X.: A hybrid algorithm for k-medoid clustering of large data sets. In: Evolutionary Computation. CEC2004. IEEE (2004)

    Google Scholar 

  7. Liu, Y., Zheng, F., Cai, K., Jiang, B.: Cross-media retrieval method based on temporal-spatial clustering and multimodal fusion. In: 2009 Fourth International Conference on Internet Computing for Science and Engineering, pp. 78–84. IEEE (2009)

    Google Scholar 

  8. Sizov, S.: GeoFolk: latent spatial semantics in web 2.0 social media. In: Proceedings of the Third ACM International Conference on Web Search and Data Mining - WSDM 2010, p. 281. ACM Press, New York (2010)

    Google Scholar 

  9. Olivares, X., Ciaramita, M., van Zwol, R.: Boosting image retrieval through aggregating search results based on visual annotations. In: Proceeding of the 16th ACM international conference on Multimedia - MM 2008. p. 189. ACM Press, New York (2008)

    Google Scholar 

  10. Aurnhammer, M., Hanappe, P., Steels, L.: Augmenting navigation for collaborative tagging with emergent semantics. In: International Semantic Web Conference (ISWC), pp. 58–71. Springer, Heidelberg (2006)

    Google Scholar 

  11. Wu, F., Pai, H.-T., Yan, Y.-F., Chuang, J.: Clustering results of image searches by annotations and visual features. Telemat. Inform. 31, 477–491 (2014)

    Article  Google Scholar 

  12. Zhuang, Y., Chiu, D.K.W., Jiang, N., Jiang, G., Wu, Z.: Personalized clustering for social image search results based on integration of multiple features. In: Zhou, S., Zhang, S., Karypis, G. (eds.) Advanced Data Mining and Applications, pp. 78–90. Springer, Heidelberg (2012)

    Google Scholar 

  13. Chatzilari, E., Nikolopoulos, S., Patras, I.: Enhancing computer vision using the collective intelligence of social media. In: New Directions in Web Data Management 1, pp. 235–271. Springer, Heidelberg (2011)

    Google Scholar 

  14. Giannakidou, E., Kompatsiaris, I.: SEMSOC: semantic, social and content-based clustering in multimedia collaborative tagging systems. In: 2008 IEEE International Conference on Semantic Computing (2008)

    Google Scholar 

  15. Lienhart, R., Romberg, S., Hörster, E.: Multilayer pLSA for multimodal image retrieval. In: Proceedings of the ACM International Conference on Image and Video Retrieval, p. 9 (2009)

    Google Scholar 

  16. Falkenauer, E.: Genetic Algorithms and Grouping Problems. Wiley, Hoboken (1998)

    MATH  Google Scholar 

  17. Lu, Y., Lu, S., Fotouhi, F., Deng, Y., Brown, S.: FGKA: a fast genetic k-means clustering algorithm. In: Proceedings of the 2004 ACM Symposium on Applied Computing, pp. 622–623 (2004)

    Google Scholar 

  18. Ma, P., Chan, K., Yao, X., Chiu, D.K.: An evolutionary clustering algorithm for gene expression microarray data analysis. IEEE Trans. Evol. Comput. 10, 296–314 (2006)

    Article  Google Scholar 

  19. Alhenak, L., Hosny, M.: Genetic-frog-leaping algorithm for text document clustering. Comput. Mater. Contin. 61, 1045–1074 (2019)

    Article  Google Scholar 

  20. Hosny, M.I., Hinti, L.A., Al-Malak, S.: A co-evolutionary framework for adaptive multidimensional data clustering. Intell. Data Anal. 22, 77–101 (2018)

    Article  Google Scholar 

  21. Al-malak, S., Hosny, M.: A multimodal adaptive genetic clustering algorithm. In: Proceedings of the Genetic and Evolutionary Computation Conference (GECCO 2016), pp. 1453–1454, Denver, Colorado. ACM (2016)

    Google Scholar 

  22. Dorigo, M.: Optimization, learning and natural algorithms, Ph.D. thesis. Politecnico di Milano, Italy (1992)

    Google Scholar 

  23. Piatrik, T., Izquierdo, E.: Subspace clustering of images using ant colony optimisation. In: 2009 16th IEEE International Conference on Image Processing (ICIP), pp. 229–232. IEEE (2009)

    Google Scholar 

  24. Goldberg, D.E.: Genetic Algorithms in Search, Optimization, and Machine Learning. Addion Wesley, Boston (1989)

    MATH  Google Scholar 

  25. Mardia, K.V., Kent, J.T., Bibby, J.M.: Multivariate analysis. Analysis 97, 1–4 (1979)

    MATH  Google Scholar 

  26. Goldberg, D.E., Deb, K.: A comparative analysis of selection schemes used in genetic algorithms. Found. Genet. Algorithms 1, 69–93 (1991)

    MathSciNet  Google Scholar 

  27. De Jong, K.A.: An Analysis of the Behavior of a Class of Genetic Adaptive Systems, Ph.D. thesis. University of Michigan, USA (1975)

    Google Scholar 

  28. Davies, D.L., Bouldin, D.W.: A cluster separation measure. IEEE Trans. Pattern Anal. Mach. Intell. PAMI-1, 224–227 (1979)

    Google Scholar 

  29. Petrovic, S.: A comparison between the silhouette index and the davies-bouldin index in labelling ids clusters. In: Proceedings of the 11th Nordic Workshop of Secure IT Systems, pp. 53–64 (2006)

    Google Scholar 

  30. Bolettieri, P., Esuli, A., Falchi, F., Lucchese, C., Perego, R., Piccioli, T., Rabitti, F.: CoPhIR: a test collection for content-based image retrieval. CoRR abs/0905.4627 (2009)

    Google Scholar 

  31. JAXB Reference Implementation — Project Kenai

    Google Scholar 

  32. Apache Lucene 5.3.1 Documentation

    Google Scholar 

  33. Lin, H., Yang, F., Kao, Y.: An efficient GA-based clustering technique. Tamkang J. Sci. 8, 113–122 (2005)

    Google Scholar 

  34. The Watchmaker Framework for Evolutionary Computation (evolutionary/genetic algorithms for Java)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Manar Hosny .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Hosny, M., Al-Malak, S. (2020). An Adaptive Genetic Algorithm Approach for Optimizing Feature Weights in Multimodal Clustering. In: Arai, K., Kapoor, S., Bhatia, R. (eds) Intelligent Computing. SAI 2020. Advances in Intelligent Systems and Computing, vol 1229. Springer, Cham. https://doi.org/10.1007/978-3-030-52246-9_13

Download citation

Publish with us

Policies and ethics