Skip to main content

Medical Data Clustering Based on Multi-objective Clustering Algorithm

  • Conference paper
  • First Online:
Machine Learning for Cyber Security (ML4CS 2022)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13657))

Included in the following conference series:

  • 553 Accesses

Abstract

With the development of massive medical data, clustering algorithm becomes an effective way for medical data processing and data mining. On the one hand, it helps medical learners find effective information patterns from massive data; on the other hand, it promotes the development of medical technology and increase productivity. For traditional clustering algorithm, a single clustering index is difficult to meet people's needs of diversity and comprehensiveness. In contrast, multi-objective clustering (MOC) considers multiple objectives at the same time, and comprehensively deals with various clustering problems and standards, such as compactness, diversity of feature selection and high data dimension. Artificial bee colony algorithm (ABC) has a faster speed and embodies the idea of swarm intelligence. It imitates the optimization process of bees, and finally obtains the global optimal value. On this basis, this paper proposed a multi-objective artificial bee colony clustering algorithm (MOC-NABC) that is combined with current better-performed clustering algorithm. It takes normalized mutual information (NMI), Calinski-Harabasz (CH), Fowlkes-Mallows index (FMI) and silhouette coefficient (SC) of clustering as the final evaluation indexes. The experiment on UCI mouse protein gene dataset shows that the overall performance effect is greatly improved, e.g. compact clustering and the effective utilization of data features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Office of the State Council: Guiding Opinions of the General Office of the State Council on Promoting and Regulating the Development of Health Medical Person Data Application (2021)

    Google Scholar 

  2. Ji, P., Zhu, D., Xie, Y.X.: Reflections on the application of scientific research sharing of health and medical data. Medicine and Philosophy 43(1), 5–8 (2022)

    Google Scholar 

  3. Andreopoulos, B., An, A., Wang, X., Schroeder, M.: A roadmap of clustering algorithms: Finding a match for a biomedical application. Briefings in Bioinformatics 10(3), 297–314 (2009)

    Google Scholar 

  4. Karaboga, D., Basturk, B.: Artificial bee colony (ABC) optimization algorithm for solving constrained optimization problems. Springer, International fuzzy systems association world congress (2007)

    Book  MATH  Google Scholar 

  5. Hancer, E.: A new multi-objective differential evolution approach for simultaneous clustering and feature selection. Eng. Appl. Artif. Intell. 87, 103307 (2020)

    Article  Google Scholar 

  6. Kuo, R.J., Zulvia, F.E.: Multi-objective cluster analysis using a gradient evolution algorithm. Soft. Comput. 24(15), 11545–11559 (2020)

    Article  Google Scholar 

  7. Dutta, D., Sil, J., Dutta, P.: Automatic clustering by multi-objective genetic algorithm with numeric and categorical features. Expert Syst. Appl. 137, 357–379 (2019)

    Article  Google Scholar 

  8. Day, W.H.E., Edelsbrunner, H.: Efficient algorithms for agglomerative hierarchical clustering methods. J. Classif. 1(1), 7–24 (1984)

    Article  MATH  Google Scholar 

  9. MacQueen, J.: Some methods for classification and analysis of multivariate observations. In: Proceedings of the fifth Berkeley symposium on mathematical statistics and probability. Oakland, CA, USA (1967)

    Google Scholar 

  10. Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: an efficient data clustering method for very large databases. ACM SIGMOD Rec. 25(2), 103–114 (1996)

    Article  Google Scholar 

  11. Reynolds, D.A.: Gaussian mixture model. Encyclopedia of biometrics 41, 659–663 (2009)

    Article  Google Scholar 

  12. Jain, A.K., Dubes, R.C.: Algorithms for Clustering Data. Prentice-Hall Inc, Upper Saddle River, NJ, USA (1988)

    MATH  Google Scholar 

  13. Rai, P., Singh, S.: A survey of clustering techniques. International Journal of Computer Applications 7(12), (2010)

    Google Scholar 

  14. Higuera, C., Gardiner, K.J., Cios, K.J.: Self-organizing feature maps identify proteins critical to learning in a mouse model of down syndrome. PLoS ONE 10(6), e0129126 (2015)

    Article  Google Scholar 

  15. Majhi, R., Panda, G., Majhi, B., Sahoo, G.: Efficient prediction of stock market indices using adaptive bacterial foraging optimization (ABFO) and BFO based techniques. Expert Syst. Appl. 36(6), 10097–10104 (2009)

    Article  Google Scholar 

  16. Zhao, L., Yang, Y.: PSO-based single multiplicative neuron model for time series prediction. Expert Syst. Appl. 36(2), 2805–2812 (2009)

    Article  Google Scholar 

  17. Kang, H.I.: A fuzzy time series prediction method using the evolutionary algorithm. In International Conference on Intelligent Computing. 530–537. Springer, Berlin, Heidelberg (2005)

    Google Scholar 

Download references

Acknowledgement

This study is supported Natural Science Foundation of Guangdong (2020A1515010749, 2022A1515012077), Guangdong Province Innovation Team “Intelligent Management and Interdisciplinary Innovation” (2021WCXTD002), Shenzhen Higher Education Support Plan (20200826144104001).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yuqin He .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chen, S., Tan, Y., Guo, J., He, Y., Geng, S. (2023). Medical Data Clustering Based on Multi-objective Clustering Algorithm. In: Xu, Y., Yan, H., Teng, H., Cai, J., Li, J. (eds) Machine Learning for Cyber Security. ML4CS 2022. Lecture Notes in Computer Science, vol 13657. Springer, Cham. https://doi.org/10.1007/978-3-031-20102-8_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-20102-8_30

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-20101-1

  • Online ISBN: 978-3-031-20102-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics