Skip to main content

On Fuzzy Clustering for Categorical Multivariate Data Induced by Polya Mixture Models

  • Conference paper
  • First Online:
Modeling Decisions for Artificial Intelligence (MDAI 2017)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10571))

Abstract

In this paper, three fuzzy clustering models for categorical multivariate data are proposed based on the Polya mixture model and q-divergence. A conventional fuzzy clustering model for categorical multivariate data is constructed by fuzzifying a multinomial mixture model (MMM) via regularizing Kullback-Leibler (KL) divergence appearing in a pseudo likelihood of an MMM, whereas MMM is extended to a Polya mixture model (PMM) and no fuzzy counterpart to PMM is proposed. The first proposed model is constructed by fuzzifying PMM, by means of regularizing KL-divergence appearing in a pseudo likelihood of the model. The other two models are derived by modifying the first proposed algorithm, which is based on the fact that one of the three fuzzy clustering models for vectorial data is similar to the first proposed model, and that another fuzzy clustering model for vectorial data can connect the other two fuzzy clustering models for vectorial data based on q-divergence. In numerical experiments, the properties of the membership of the proposed methods were observed using an artificial dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. MacQueen, J.B.: Some methods for classification and analysis of multivariate observations. In: Proceedings of 5th Berkeley Symposium on Mathematical Statistics and Probability, pp. 281–297 (1967)

    Google Scholar 

  2. Bezdek, J.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)

    Book  MATH  Google Scholar 

  3. Miyamoto, S., Mukaidono, M.: Fuzzy \(c\)-means as a regularization and maximum entropy approach. In: Proceedings of 7th International Fuzzy Systems Association World Congress (IFSA 1997), vol. 2, pp. 86–92 (1997)

    Google Scholar 

  4. Miyamoto, S., Kurosawa, N.: Controlling cluster volume sizes in fuzzy \(c\)-means clustering. In: Proceedings of SCIS&ISIS2004, pp. 1–4 (2004)

    Google Scholar 

  5. Ichihashi, H., Honda, K., Tani, N.: Gaussian mixture PDF approximation and fuzzy \(c\)-means clustering with entropy regularization. In: Proceedings of 4th Asian Fuzzy System Symposium, pp. 217–221 (2000)

    Google Scholar 

  6. Miyamoto, S., Ichihashi, H., Honda, K.: Algorithms for Fuzzy Clustering. Springer, Heidelberg (2008)

    Google Scholar 

  7. Rigouste, L., Cappé, O., Yvon, F.: Inference and evaluation of the multinomial mixture model for text clustering. Inf. Process. Manag. 43(5), 1260–1280 (2007)

    Article  Google Scholar 

  8. Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. J. Royal Stat. Soc. Ser. B 39, 1–38 (1977)

    Google Scholar 

  9. Honda, K., Oshio, S., Notsu, A.: FCM-type fuzzy co-clustering by K-L information regularization. In: Proceedings of FUZZ-IEEE2014, pp. 2505–2510 (2014)

    Google Scholar 

  10. Madsen, R.E., Kauchak, D., Elkan, C.: Modeling word burstiness using the Dirichlet distribution. In: Proceedings of ICML, pp. 545–552 (2005)

    Google Scholar 

  11. Chernoff, H.: A measure of asymptotic efficiency for tests of a hypothesis based on a sum of observations. Ann. Math. Stat. 23, 493–507 (1952)

    Google Scholar 

  12. Tsallis, C.: Possible generalization of Boltzmann-Gibbs statistics. J. Stat. Phys. 52, 479–487 (1988)

    Google Scholar 

  13. Menard, M., Courboulay, V., Dardignac, P.: Possibilistic and probabilistic fuzzy clustering: unification within the framework of the non-extensive thermostatistics. Pattern Recogn. 36, 1325–1342 (2003)

    Google Scholar 

  14. Miyamoto, S., Umayahara, K.: Methods in hard and fuzzy clustering. In: Liu, Z.-Q., Miyamoto, S. (eds.) Soft Computing and Human-Centered Machines. Springer, Tokyo (2000)

    Google Scholar 

  15. Krishnapuram, R., Keller, J.M.: A possibilistic approach to clustering. IEEE Trans. Fuzzy Syst. 1, 98–110 (1993)

    Google Scholar 

  16. Kanzawa, Y.: On possibilistic clustering methods based on Shannon/Tsallis-entropy for spherical data and categorical multivariate data. In: Torra, V., Narukawa, Y. (eds.) MDAI 2015. LNCS, vol. 9321, pp. 115–128. Springer, Cham (2015). doi:10.1007/978-3-319-23240-9_10

    Chapter  Google Scholar 

  17. Kanzawa, Y.: On possibilistic clustering algorithms based on noise clustering. In: Proceedings of SCIS&ISIS2016, pp. 42–47 (2016)

    Google Scholar 

  18. Kanzawa, Y.: Generalization of quadratic regularized and standard fuzzy c-means clustering with respect to regularization of hard c-means. In: Torra, V., Narukawa, Y., Navarro-Arribas, G., Megías, D. (eds.) MDAI 2013. LNCS, vol. 8234, pp. 152–165. Springer, Heidelberg (2013). doi:10.1007/978-3-642-41550-0_14

    Chapter  Google Scholar 

  19. Kanzawa, Y.: Power-regularized fuzzy c-means clustering with a fuzzification parameter less than one. JACIII 20(4), 561–570 (2016)

    Article  Google Scholar 

Download references

Acknowledgment

This work was supported by JSPS KAKENHI Grant Number 15K00348.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yuchi Kanzawa .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Kanzawa, Y. (2017). On Fuzzy Clustering for Categorical Multivariate Data Induced by Polya Mixture Models. In: Torra, V., Narukawa, Y., Honda, A., Inoue, S. (eds) Modeling Decisions for Artificial Intelligence. MDAI 2017. Lecture Notes in Computer Science(), vol 10571. Springer, Cham. https://doi.org/10.1007/978-3-319-67422-3_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-67422-3_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-67421-6

  • Online ISBN: 978-3-319-67422-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics