skip to main content
10.1145/3587828.3587829acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicscaConference Proceedingsconference-collections
research-article

Improvement for Large-Scale Image Data using Fuzzy Rough C-Mean Based Unsupervised CNN Clustering: An Empirical Study on designbyhumans.com

Authors Info & Claims
Published:20 June 2023Publication History

ABSTRACT

Abstract: Clustering analysis, specifically for extensive image data, is increasingly being applied in various fields such as finance, risk management, prediction, etc., and has been a fascinating subject in many scientific discussions. Deep learning, a widely used approach, and classical methods address complex classification problems stemming from real-world cases. In this study, we took various approaches to classification problems and measured their effectiveness by combining different techniques using the results of different scenarios. Many approaches have been proposed to solve the clustering problem; complex clustering methods such as hierarchical, density-based, centroid-based, and graph theoretical have been submitted. However, when it comes to real-world applications, they exposed significant drawbacks when the dataset introduced immeasurable vagueness, uncertainty, or overlapping samples that made it impossible to predict and classify. Several attempts have been made to improve the clustering method's performance, including joint CNN clustering models. Still, many of them carry the cons of the complicated clustering method, which limits the capability of CNN. The combined CNN clustering method is designed to address the problem with those deterministic CNN clustering models and was evaluated on a dataset we collected from the website designbyhumans.com, with enough features to represent a non-synthetic dataset. This research aims to improve upon the established model by using estimation techniques in determining model parameters and graphing plots to justify those choices and give insights into how the model performs on a non-synthetic dataset like ours. We concluded that the model significantly improved compared with a popular complex clustering method, which has been evaluated by computational time, using different metrics to represent how better separated each cluster was. Based on conducted experiments and the future development of the method, we discussed and addressed some of the drawbacks of this approach.

References

  1. P. Zdzisław, "Rough set theory and its applications", Journal of Telecommunications and Information Technology, vol. 3, pp. 7-10, 2002.Google ScholarGoogle Scholar
  2. Zimmermann, H.-J. (2010), Fuzzy set theory. WIREs Comp Stat, 2: 317-332. https://doi.org/10.1002/wics.82Google ScholarGoogle ScholarCross RefCross Ref
  3. James C. Bezdek, Robert Ehrlich, William Full, FCM: The fuzzy c-means clustering algorithm,Computers & Geosciences,Volume 10, Issues 2–3, 1984,Pages 191-203,ISSN 0098-3004,https://doi.org/10.1016/0098-3004(84)90020-7.Google ScholarGoogle ScholarCross RefCross Ref
  4. Ubukata, S., Notsu, A. and Honda, K., 2017. General formulation of rough C-means clustering. International Journal of Computer Science and Network Security, 17(9), pp.29-38.Google ScholarGoogle Scholar
  5. H. Qinghua and Y. Daren, "An Improved Clustering Algorithm for Information Granulation", 2005.Google ScholarGoogle Scholar
  6. Hinton, G.E., 2009. Deep belief networks. Scholarpedia, 4(5), p.5947.Google ScholarGoogle Scholar
  7. Salakhutdinov, R. and Larochelle, H., 2010, March. Efficient learning of deep Boltzmann machines. In Proceedings of the thirteenth international conference on artificial intelligence and statistics (pp. 693-700). JMLR Workshop and Conference Proceedings.Google ScholarGoogle Scholar
  8. Zhou, C. and Paffenroth, R.C., 2017, August. Anomaly detection with robust deep autoencoders. In Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining (pp. 665-674).Google ScholarGoogle Scholar
  9. O'Shea, K. and Nash, R., 2015. An introduction to convolutional neural networks. arXiv preprint arXiv:1511.08458.Google ScholarGoogle Scholar
  10. Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. ImageNet classification with deep convolutional neural networks. In Proceedings of the 25th International Conference on Neural Information Processing Systems - Volume 1 (NIPS'12). Curran Associates Inc., Red Hook, NY, USA, 1097–1105.Google ScholarGoogle Scholar
  11. He, K., Zhang, X., Ren, S. and Sun, J., 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 770-778).Google ScholarGoogle Scholar
  12. Simonyan, K. and Zisserman, A., 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556.Google ScholarGoogle Scholar
  13. Long, J., Shelhamer, E. and Darrell, T., 2015. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 3431-3440).Google ScholarGoogle Scholar
  14. Hsu, C.C. and Lin, C.W., 2017. Cnn-based joint clustering and representation learning with feature drift compensation for large-scale image data. IEEE Transactions on Multimedia, 20(2), pp.421-429.Google ScholarGoogle Scholar
  15. Riaz, S., Arshad, A. and Jiao, L., 2018. Fuzzy rough C-mean based unsupervised CNN clustering for large-scale image data. Applied Sciences, 8(10), p.1869.Google ScholarGoogle Scholar
  16. Designbyhumans.comGoogle ScholarGoogle Scholar

Index Terms

  1. Improvement for Large-Scale Image Data using Fuzzy Rough C-Mean Based Unsupervised CNN Clustering: An Empirical Study on designbyhumans.com
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            ICSCA '23: Proceedings of the 2023 12th International Conference on Software and Computer Applications
            February 2023
            385 pages
            ISBN:9781450398589
            DOI:10.1145/3587828

            Copyright © 2023 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 20 June 2023

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article
            • Research
            • Refereed limited
          • Article Metrics

            • Downloads (Last 12 months)40
            • Downloads (Last 6 weeks)3

            Other Metrics

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format .

          View HTML Format