Abstract
In this paper, three fuzzy clustering models for categorical multivariate data are proposed based on the Polya mixture model and q-divergence. A conventional fuzzy clustering model for categorical multivariate data is constructed by fuzzifying a multinomial mixture model (MMM) via regularizing Kullback-Leibler (KL) divergence appearing in a pseudo likelihood of an MMM, whereas MMM is extended to a Polya mixture model (PMM) and no fuzzy counterpart to PMM is proposed. The first proposed model is constructed by fuzzifying PMM, by means of regularizing KL-divergence appearing in a pseudo likelihood of the model. The other two models are derived by modifying the first proposed algorithm, which is based on the fact that one of the three fuzzy clustering models for vectorial data is similar to the first proposed model, and that another fuzzy clustering model for vectorial data can connect the other two fuzzy clustering models for vectorial data based on q-divergence. In numerical experiments, the properties of the membership of the proposed methods were observed using an artificial dataset.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
MacQueen, J.B.: Some methods for classification and analysis of multivariate observations. In: Proceedings of 5th Berkeley Symposium on Mathematical Statistics and Probability, pp. 281–297 (1967)
Bezdek, J.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)
Miyamoto, S., Mukaidono, M.: Fuzzy \(c\)-means as a regularization and maximum entropy approach. In: Proceedings of 7th International Fuzzy Systems Association World Congress (IFSA 1997), vol. 2, pp. 86–92 (1997)
Miyamoto, S., Kurosawa, N.: Controlling cluster volume sizes in fuzzy \(c\)-means clustering. In: Proceedings of SCIS&ISIS2004, pp. 1–4 (2004)
Ichihashi, H., Honda, K., Tani, N.: Gaussian mixture PDF approximation and fuzzy \(c\)-means clustering with entropy regularization. In: Proceedings of 4th Asian Fuzzy System Symposium, pp. 217–221 (2000)
Miyamoto, S., Ichihashi, H., Honda, K.: Algorithms for Fuzzy Clustering. Springer, Heidelberg (2008)
Rigouste, L., Cappé, O., Yvon, F.: Inference and evaluation of the multinomial mixture model for text clustering. Inf. Process. Manag. 43(5), 1260–1280 (2007)
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. J. Royal Stat. Soc. Ser. B 39, 1–38 (1977)
Honda, K., Oshio, S., Notsu, A.: FCM-type fuzzy co-clustering by K-L information regularization. In: Proceedings of FUZZ-IEEE2014, pp. 2505–2510 (2014)
Madsen, R.E., Kauchak, D., Elkan, C.: Modeling word burstiness using the Dirichlet distribution. In: Proceedings of ICML, pp. 545–552 (2005)
Chernoff, H.: A measure of asymptotic efficiency for tests of a hypothesis based on a sum of observations. Ann. Math. Stat. 23, 493–507 (1952)
Tsallis, C.: Possible generalization of Boltzmann-Gibbs statistics. J. Stat. Phys. 52, 479–487 (1988)
Menard, M., Courboulay, V., Dardignac, P.: Possibilistic and probabilistic fuzzy clustering: unification within the framework of the non-extensive thermostatistics. Pattern Recogn. 36, 1325–1342 (2003)
Miyamoto, S., Umayahara, K.: Methods in hard and fuzzy clustering. In: Liu, Z.-Q., Miyamoto, S. (eds.) Soft Computing and Human-Centered Machines. Springer, Tokyo (2000)
Krishnapuram, R., Keller, J.M.: A possibilistic approach to clustering. IEEE Trans. Fuzzy Syst. 1, 98–110 (1993)
Kanzawa, Y.: On possibilistic clustering methods based on Shannon/Tsallis-entropy for spherical data and categorical multivariate data. In: Torra, V., Narukawa, Y. (eds.) MDAI 2015. LNCS, vol. 9321, pp. 115–128. Springer, Cham (2015). doi:10.1007/978-3-319-23240-9_10
Kanzawa, Y.: On possibilistic clustering algorithms based on noise clustering. In: Proceedings of SCIS&ISIS2016, pp. 42–47 (2016)
Kanzawa, Y.: Generalization of quadratic regularized and standard fuzzy c-means clustering with respect to regularization of hard c-means. In: Torra, V., Narukawa, Y., Navarro-Arribas, G., Megías, D. (eds.) MDAI 2013. LNCS, vol. 8234, pp. 152–165. Springer, Heidelberg (2013). doi:10.1007/978-3-642-41550-0_14
Kanzawa, Y.: Power-regularized fuzzy c-means clustering with a fuzzification parameter less than one. JACIII 20(4), 561–570 (2016)
Acknowledgment
This work was supported by JSPS KAKENHI Grant Number 15K00348.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Kanzawa, Y. (2017). On Fuzzy Clustering for Categorical Multivariate Data Induced by Polya Mixture Models. In: Torra, V., Narukawa, Y., Honda, A., Inoue, S. (eds) Modeling Decisions for Artificial Intelligence. MDAI 2017. Lecture Notes in Computer Science(), vol 10571. Springer, Cham. https://doi.org/10.1007/978-3-319-67422-3_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-67422-3_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-67421-6
Online ISBN: 978-3-319-67422-3
eBook Packages: Computer ScienceComputer Science (R0)